Neton: a new tool for discovering the semantic potential of biomedical data in umls semantic network

Download
2010
Özdemir, Birsen Gülden
The Unified Medical Language System Semantic Network (UMLS SN) being an upper-level abstraction of the biomedical domain has a complex structure due to many relationships, making it difficult for human orientation. Therefore, while the SN is a valuable source for modeling contents of the biomedical domain its usage is limited. NetON was designed and built for the automatic transformation of UMLS SN to OWL sublanguages to support semantic operations between biomedical systems. NetON uses advances in the Semantic Web, a candidate technology for sustaining knowledge intensive tasks. Ontology Web Language (OWL) sublanguage rules are used to represent information in UMLS SN. The major contribution of NetON is the opportunity of automatic transformation of UMLS SN to OWL sublanguages named as OWL Basic Species. The aim of NetON is maximum possible information transformation from UMLS SN. The only information that is not able to be transformed to any OWL Basic Species due to the lack of appropriate constructors in OWL standard is inheritance blockings in UMLS SN. In UMLS SN, there are unseen assertions that can be inferred by using inference rules on explicitly specified assertions which are not essentially valid for all the descendants. Deduction outcomes of any OWL reasoners on NetON OWL Basic Species will also include false positives due to the lack of inheritance blocking information. The algorithms of the second dimension consider the inheritance blocking information while executing inference rules. As this cannot be done by any OWL reasoner, the second dimension offers a solution for application developers.

Suggestions

Data integration over horizontally partitioned databases in service-oriented data grids
Sunercan, Hatice Kevser Sönmez; Çiçekli, Fehime Nihan; Alpdemir, Mahmut Nedim; Department of Computer Engineering (2010)
Information integration over distributed and heterogeneous resources has been challenging in many terms: coping with various kinds of heterogeneity including data model, platform, access interfaces; coping with various forms of data distribution and maintenance policies, scalability, performance, security and trust, reliability and resilience, legal issues etc. It is obvious that each of these dimensions deserves a separate thread of research efforts. One particular challenge among the ones listed above tha...
Systematic component-oriented development with axiomatic design
Toğay, Cengiz; Doğru, Ali Hikmet; Department of Computer Engineering (2008)
In this research, component oriented development is supported with design guidance by extending the Axiomatic Design Theory for component orientation, and utilizing domain engineering and ontology mechanisms. Guidance is offered in the form of suggesting missing components and discovering incompatibilities among the candidate elements of software development, corresponding to different phases such as requirement analysis, design, and implementation. A mature domain concept is developed suggesting the availa...
Analysis Pattern of Sanliurfa Harran Plain in UML and its Implementation in Geodatabase
Çubuk, Ulaş; Usul, Nurünnisa; Department of Geodetic and Geographical Information Technologies (2004)
An emerging trend in GIS is the adoption of object oriented concepts for both logical and physical design phases. Extensive research has been conducted on logical design of GIS and several conceptual models have been proposed. Classical data models like the relational data model have proven to be insufficient for the conceptual modeling of spatial data. Therefore among other object oriented modeling tools, a new modeling language, Unified Modeling Language (UML) has also become a popular modeling tool in th...
A clustering method for the problem of protein subcellular localization
Bezek, Perit; Atalay, Mehmet Volkan; Department of Computer Engineering (2006)
In this study, the focus is on predicting the subcellular localization of a protein, since subcellular localization is helpful in understanding a protein’s functions. Function of a protein may be estimated from its sequence. Motifs or conserved subsequences are strong indicators of function. In a given sample set of protein sequences known to perform the same function, a certain subsequence or group of subsequences should be common; that is, occurrence (frequency) of common subsequences should be high. Our ...
Multi-resolution visualization of large scale protein networks enriched with gene ontology annotations
Yaşar, Sevgi; Can, Tolga; Department of Computer Engineering (2009)
Genome scale protein-protein interactions (PPIs) are interpreted as networks or graphs with thousands of nodes from the perspective of computer science. PPI networks represent various types of possible interactions among proteins or genes of a genome. PPI data is vital in protein function prediction since functions of the cells are performed by groups of proteins interacting with each other and main complexes of the cell are made of proteins interacting with each other. Recent increase in protein interactio...
Citation Formats
B. G. Özdemir, “Neton: a new tool for discovering the semantic potential of biomedical data in umls semantic network,” Ph.D. - Doctoral Program, Middle East Technical University, 2010.