skip to main content
Guest
My Research
My Account
Sign out
Sign in
This feature requires javascript
Library Search
Find Databases
Browse Search
E-Journals A-Z
E-Books A-Z
Citation Linker
Help
Language:
English
Vietnamese
This feature required javascript
This feature requires javascript
Primo Search
All Library Resources
All
Course Materials
Course Materials
Search For:
Clear Search Box
Search in:
All Library Resources
Or hit Enter to replace search target
Or select another collection:
Search in:
All Library Resources
Search in:
Print Resources
Search in:
Digital Resources
Search in:
Online E-Resources
Advanced Search
Browse Search
This feature requires javascript
Search Limited to:
Search Limited to:
Resource type
criteria input
All items
Books
Articles
Images
Audio Visual
Maps
Graduate theses
Show Results with:
criteria input
that contain my query words
with my exact phrase
starts with
Show Results with:
Search type Index
criteria input
anywhere in the record
in the title
as author/creator
in subject
Full Text
ISBN
ISSN
TOC
Keyword
Field
Show Results with:
in the title
Show Results with:
anywhere in the record
in the title
as author/creator
in subject
Full Text
ISBN
ISSN
TOC
Keyword
Field
This feature requires javascript
New statistical approaches to estimating mixture models with application in anti-cancer drug studies
DOI: 10.22024/UniKent/01.02.99500
Digital Resources/Online E-Resources
Citations
Cited by
View Online
Details
Recommendations
Reviews
Times Cited
External Links
This feature requires javascript
Actions
Add to My Research
Remove from My Research
E-mail
Print
Permalink
Citation
EasyBib
EndNote
RefWorks
Delicious
Export RIS
Export BibTeX
This feature requires javascript
Title:
New statistical approaches to estimating mixture models with application in anti-cancer drug studies
Author:
Wang, Tong
Subjects:
QA Mathematics (inc Computing science)
Description:
When confronted with applications to real data problems, it is always challenging to simultaneously deal with potential group structures, high-dimensional features and the relationship between predictors and response variables. Most of the time missing data exist across the whole dataset, which makes the problems even more tricky. Meanwhile, with the advent of big data and high-throughput technology, the dimension of the given data could easily exceed the sample size, which places the ordinary linear regression into a difficult position where the normal equation is degenerate and traditional statistical techniques cannot be used properly. Notwithstanding, generally speaking, there is only a small part of variables being informative to the needs of researchers by significantly affecting the dependent variables. To address these issues, we develop a model to realise classification, variable selection and parameter estimation simultaneously in this thesis. This model also shows flexibility and inclusiveness to datasets with missingness. Moreover, by introducing the l_{q}-norm penalty to tune the sparsity level to the specific needs of researchers, our methodology has been improved further. With the help of Bayesian Information Criterion, we can specify the number of components and degree of penalty for this modelling. After that, the uses of marginal analysis and the k-means clustering method facilitate the following application to whole datasets by realising a dimension reduction purpose. In the application to the anti-cancer drug and screened gene expression data, our methodology shows good abilities for clustering drugs into a finite number of groups and screening out the related genes which play significant roles in configuring the corresponding groups. With our specific enhancements to the model, including missingness indication and adjustable sparsity level, our methodology has the potential to be applied to a wide range of datasets in the scientific area, including but not limited to economics, finance, biology, and physics. Based on the above applications, we also propose another method to determine the number of components in a mixture model, which provides an alternative view on the clustering problem. Afterwards, we examine the inherent skewness of given data by resorting to skew normal distributions. After adaptations to the traditional skew normal density function, we successfully estimate the parameters in a skew normal distribution under different skewness scenarios. The asymptotic distributions for the MLE estimates of our skew normal distribution are also obtained with detailed proofs attached in the Appendix. Meanwhile, some intriguing asymptotic properties behind our skew normal function are discussed later in this chapter. Lastly, we propose the four-piece distribution family for skew normal mixture models to consider the group structure, which shows a good estimation accuracy in the following simulation studies. From these simulations, the above models have been verified as a complement to the existing R package mclust which is popular for handling model-based clustering, classification, and density estimation problems.
Publisher:
University of Kent
Creation Date:
2022
Language:
English
Identifier:
DOI: 10.22024/UniKent/01.02.99500
Source:
EThOS: Electronic Theses Online Service (Full Text)
This feature requires javascript
This feature requires javascript
Back to results list
This feature requires javascript
This feature requires javascript
Searching Remote Databases, Please Wait
Searching for
in
scope:(TDTS),scope:(SFX),scope:(TDT),scope:(SEN),primo_central_multiple_fe
Show me what you have so far
This feature requires javascript
This feature requires javascript