文科
Mining Non-lattice Subgraphs for Biomedical Ontology Quality Assurance
講座題目:Mining Non-lattice Subgraphs for Biomedical Ontology Quality Assurance
講座人:崔麗聰 副教授
講座時(shí)間:16:00
講座日期:2017-6-20
地點(diǎn):長(zhǎng)安校區(qū) 圖書(shū)館一層小會(huì)議室
主辦單位:計(jì)算機(jī)科學(xué)學(xué)院
講座內(nèi)容:Biomedical ontologies andterminologies play a vital role in healthcare information management, dataintegration, and decision support. Quality issues in biomedical ontologies, if not addressed, can affect thequality of all downstream information systems relying on them as a knowledgesource. Thus Ontology quality assurance (OQA) is an indispensable part of theontology engineering cycle. However, it is labor-intensive and time-consumingto discover errors or inconsistencies by manual review of large biomedicalontologies. Effective, automated approaches for improving the quality ofbiomedical ontologies are needed to overcome the limitations of manual work. Inthis talk, I will present a non-lattice-based approach to detecting and miningpotential errors in SNOMED CT, the most comprehensive clinical health careterminology worldwide. I will introduce a scalable MapReduce pipeline forexhaustively extracting non-lattice pairs from SNOMED CT, and an effective methodfor mining lexical patterns in non-lattice subgraphs to detect errors in SNOMEDCT and suggest remediations. Since virtually all biomedical ontologies areorganized into subsumption hierarchies and have concept names, ournon-lattice–based approach can be generalized and applied to other biomedicalontologies for quality assurance purposes.