Room 505, Cosmology Building, National Taiwan University + Cisco WebEx, Physical+Online Seminar
(實體+線上演講 台灣大學次震宇宙館505研討室+ Cisco WebEx)
Integrative Co-Embedding of Multi-View Data Sets
Haesun Park (Georgia Institute of Technology)
Abstract
An integrative co-embedding method based on constrained low rank approximation is introduced. The method achieves knowledge fusion of multi-type data and projects the various types of objects onto a common lower-dimensional space. This produces a more informed representation that maintains both in-type and across-type semantic proximity between objects. The effectiveness of the proposed method is illustrated using examples of document data clustering where we utilize co-embedding of papers, authors, key words, and patient profiling in healthcare data utilizing traditional medical records, as well as patients’ interactions via browsing and searching on healthcare web portals. One important feature of the proposed co-embedding method is its ability to compute embeddings for new, previously unobserved patient data efficiently and effectively, eliminating the need to revisit the entire data set or recomputing the embedding.
Organizer: Matthew M. Lin (NCKU)