Date of Award
Doctor of Philosophy (PhD)
School of Computing
Dr. James Z. Wang, Committee Chair
Dr. Pradip K Srimani
Dr. Jim Martin
Dr. Feng Luo
Millions of text data are penetrating into our daily life. These unstructured text data serve as a huge source of information. Eﬃcient organization and analysis of the overwhelming text can ﬁlter out irrelevant and redundant information, uncover invaluable knowledge, thus signiﬁcantly reduce human eﬀort, facilitate knowledge discovery and enhance cognitive abilities. Semantic similarity analysis among text objects is one of the fundamental problems in text mining including document classiﬁ-cation/clustering, recommendation, query expansion, information retrieval, relevance feedback, word sense disambiguation, etc. While a combination of common sense and domain knowledge could let a person quickly determine if two objects are similar, the computers understand very little of human thinking. Knowledge resources such as ontologies can greatly capture the semantics of text objects, which enables the numeric representation of both domain knowledge and context information. In this dissertation, we develop a series of techniques to measure the semantic similarity of objects in multiple domains. By utilizing the structured knowledge that has already been established, we explore the domain knowledge from the existing lexical resources and incorporate it into speciﬁc applications within diﬀerent domains. Speciﬁcally, we investigate the semantic similarities between gene products using Gene Ontology in biology domain. In text domain, we propose a hybrid representation of text objects (words and documents) based on WordNet which exploits both context and ontology information to extract meaningful information from the unstructured text to measure the semantic similarity of text documents.
Song, Xuebo, "Ontology-based Domain-specific Semantic Similarity Analysis and Applications" (2018). All Dissertations. 2105.