Converting data from multi-instance to single-instance representations using p-Order Laplacian projections
dc.contributor.advisor | Wang, Hua | |
dc.contributor.author | Elbeleidy, Saad | |
dc.date.accessioned | 2018-06-20T16:16:18Z | |
dc.date.accessioned | 2022-02-03T13:11:25Z | |
dc.date.available | 2019-06-19T16:16:18Z | |
dc.date.available | 2022-02-03T13:11:25Z | |
dc.date.issued | 2018 | |
dc.identifier | Elbeleidy_mines_0052N_11550.pdf | |
dc.identifier | T 8545 | |
dc.identifier.uri | https://hdl.handle.net/11124/172409 | |
dc.description | Includes bibliographical references. | |
dc.description | 2018 Summer. | |
dc.description.abstract | Fields such as Computer Vision and Natural Language Processing have a high applicability of Machine Learning algorithms. With large amounts of complex data readily available, there are two prominent approaches to handling data complexity using Machine Learning. First, dimensionality reduction methods such as Principal Component Analysis (PCA) or Laplacian Embeddings (LE) can minimize the number of features needed to accurately represent data. This approach is often effective but has two main drawbacks. First, the input to the dimensionality reduction method is a summary of all the components that make up the data and some valuable information may be lost. Second, dimensionality reduction methods are often sensitive to outliers. The second approach to dealing with complex data is Multi-Instance Learning (MIL). MIL introduces a new paradigm for data representation by viewing data as a grouping, called a bag, of instances. Each instance is modeled the same way the whole data would be represented but now the datum is represented as a bag of instances. Multi-Instance representation can be effective since they focus on modeling all the pieces that make up the whole datum. However, in order to use this representation in Machine Learning applications we must use MIL algorithms and cannot directly use traditional Machine Learning algorithms. In this work, we propose a method to tackle the issues that may arise in dimensionality reduction methods and MIL methods. We do this by learning a reduced-dimension, integrated, outlier resilient single instance representation for our data. We first propose a new dimensionality reduction method of p-Order Laplacian Embeddings (pOLE) that is less sensitive to outliers than traditional LE. We then use this method to learn a projection from the instances of each bag in a Multi-Instance representation of data. This projection, combined with the Single-Instance representation of the same data can produce a reduced-dimension, integrated, outlier resilient Single-Instance representation | |
dc.format.medium | born digital | |
dc.format.medium | masters theses | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado School of Mines. Arthur Lakes Library | |
dc.relation.ispartof | 2010-2019 - Mines Theses & Dissertations | |
dc.rights | Copyright of the original work is retained by the author. | |
dc.subject | machine learning | |
dc.subject | Laplacian embedding | |
dc.title | Converting data from multi-instance to single-instance representations using p-Order Laplacian projections | |
dc.type | Text | |
dc.contributor.committeemember | Han, Qi | |
dc.contributor.committeemember | Williams, Thomas | |
dcterms.embargo.terms | 2019-06-19 | |
dcterms.embargo.expires | 2019-06-19 | |
thesis.degree.name | Master of Science (M.S.) | |
thesis.degree.level | Masters | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | Colorado School of Mines | |
dc.rights.access | Embargo Expires: 06/19/2019 |