Development of a unified analysis framework for multicolor flow cytometry data based on quasi-supervised learning
tarafından
 
Köktürk Güzel, Başak Esin, author.

Başlık
Development of a unified analysis framework for multicolor flow cytometry data based on quasi-supervised learning

Yazar
Köktürk Güzel, Başak Esin, author.

Yazar Ek Girişi
Köktürk Güzel, Başak Esin, author.

Fiziksel Tanımlama
xi, 93 leaves: illustrarions, charts;+ 1 computer laser optical disc.

Özet
In this dissertation, automatic compensation and gating strategies are investigated for multi-color flow cytometry data analysis. We propose two clustering algorithms that combine the quasi-supervised learning algorithm with an expectation-maximization routine for automatic gating. The quasi-supervised learning algorithm estimates the posterior probabilities of the different cell populations at each sample in a dataset in a manner that does not involve fitting a parametric model to the data. We have developed two different binary divisive clustering algorithms based on expectation maximization with responsibility values calculated using the quasi-supervised learning algorithm instead of the probabilistic models used in conventional expectation maximization applications. Our clustering algorithms determine the number of clusters in run-time by measuring the overlap between the estimated clusters in each provisional division and comparing it with the previous one to determine whether the division is warranted or not. Since this type of clustering is indifferent to the underlying distribution of dataset, it is well suited to automatic flow cytometry gating. The second clustering algorithm improves upon the first one using a simulated annealing approach. Its iterative structure allows finding the global minimum of a cost functional that achieves the best separation point by gradually smoothing the decision regions in each iteration. Finally, we have developed a joint diagonalization and clustering method for automatic compensation of flow data based on the methods above. The proposed method identifies cell sub groups using the annealing-based model-free expectation-maximization algorithm and finds a data transformation matrix that achieves orthogonality of the covariance structure of each identified cell cluster using fast Frobenius diagonalization. We have tested all proposed algortihms on both synthetically created datasets and real multi-color flow cytometry datasets. The results show that our automated gating algorithms are very successful in identifying the distinct cell groups so long as there is enough statistical evidence for their presence. In addition, the automated compensation procedure was also successfully applied on the synthetically created dataset and real multi-color flow cytometry data of lymphocytes that are a low autofluorescence cell group. However, the automated compensation algorithm needs further study to be generalized to high autofluorescence cell types where proper compensation does not necessarily coincide with an orthogonal covariance structure.

Konu Başlığı
Flow cytometry.
 
Supervised learning (Machine learning).
 
Hierarchical clustering (Cluster analysis).

Yazar Ek Girişi
Karaçalı, Bilge,

Tüzel Kişi Ek Girişi
İzmir Institute of Technology. Electronics and Communication Engineering.

Tek Biçim Eser Adı
Thesis (Doctoral)--İzmir Institute of Technology:Electronics and Communication Engineering.
 
İzmir Institute of Technology: Electronics and Communication Engineering--Thesis (Doctoral).

Elektronik Erişim
Access to Electronic Versiyon.


LibraryMateryal TürüDemirbaş NumarasıYer NumarasıDurumu/İade Tarihi
IYTE LibraryTezT001642QH585.5.F56 K79 2017Tez Koleksiyonu