Hybridization of Max.C-Value and Permutation for Refined Concept Extraction

Authors

  • R. Manimala, G. Muthu Lakshmi

Keywords:

Ontology, ontology learning, Ontology Acquisition

Abstract

Ontology is the representation of knowledge with a set of concepts and the relationships between those concepts, for a domain. It describes about the domain.It is mainly used to organize the web data for maximizing the semantic access and to extract the knowledge from different format of data. Intelligence is necessary for the creation and processing of those semantic metadata. Manual ontology construction is labour-intensive, error-prone process, rigid, expensive, time consuming and complex task. Ontology Acquisition or ontology learning includes the automatic extraction of domain’s terms, concepts and the relationships between those concepts from the corpus of text, and encoding them with an ontology language for easy information retrieval. In the automatic ontology learning, Concept extraction is the main step to improve the accuracy .This paper describes the concept extraction using Max.CVPC method. Our proposed work is the Hybridization of Maximum C-Value of Permutation of UMLS Concepts which is further filtered with the threshold to extract the dominant concepts or refined concepts. It improves the precision as well as extracts the dominant concepts to build the appropriate ontology. Here we used Permutation technique instead of n-gram to increase the sub terms and its weightage.The method is analyzed with Genia Corpus which contains 2000 MEDLINE abstracts. The results are compared with the n-gram technique which is also filtered with maximum of C-Value. We evaluate our work with the metrics precision, Recall and F-Measure.

Downloads

Download data is not yet available.

References

BorisGelfand, MarilynWulfekuhler ,WilliamF.PunchIII,”AutomatedConceptExtractionFromPlainText”,1998.

Buitelaar,P.,Cimiano,P andMagnini,B, ”Ontology learning from text: an overview. In: Ontology Learning from Text: Methods, Evaluation and Applications,Amsterdam,IOSPress, 123, 3–12,2005.

Daniel Jurafsky & James H. Martin,” Speech and Language Processing”,chapter 4, Draft of September 1, 2014.

Jin-Dong Kim , T. Ohta, Y. Tateisi and J. Tsujii , “GENIA corpus—a semantically annotated corpus for bio-textmining”, BioInformatics Vol. 19 Suppl. 1 2003, pages i180–i182, DOI: 10.1093/bioinformatics/btg1023.

Jon Patrick, Min Li,”, Journal of the American Medical Informatics Association” ·,September 2010.

K. Frantzi, S. Ananiadou, and H. Mima. Automatic recognition of multi-word terms: the c-value/nc-value method. International Journal on Digital Libraries, 3(2):115–130, 2000.

K.Karthikeyan,V. Karthikeyani,”Ontology Based Concept Hierarchy Extraction of Web Data”, Indian Journal of Science and Technology, Vol 8(6), 536–547, March 2015.

Luca Soldaini,Nazli Goharian ,”QuickUMLS: a fast, unsupervised approach for medical concept extraction “,MedIR Workshop at SIGIR 2016.

Lovins JB. Development of a stemming algorithm; 1968. p. 22–31.

Muhammad Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Waqar Mahmood and Hafiza Mahnoor Abbasi,” A survey of ontology learning techniques and applications”,2018.

Paul M. Ramirez and Chris A. Mattmann,” ACE: Improving Search Engines via Automatic Concept Extraction”.

Philipp Cimiano, Andreas Hotho, Andreas Hotho,” Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis”, Journal of Artificial Intelligence Research 24 (2005) 305–339 ,Aug 2005.

Qing Yang ,Kai-min Cai , Yan LI ,Rui-qing Liu,” An Area Concept Extraction Algorithm Based on Association Rule”, International Conference On Computer Design And Appliations (ICCDA 2010),2010.

Raghavendra Chalapathy ,Ehsan Zare Borzeshi,Massimo Piccardi,” Bidirectional LSTM-CRF for Clinical Concept Extraction”, Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP),December 2016.

Wanda Pratt, Ph.D, Meliha Yetisgen-Yildiz, M.S,” A Study of Biomedical Concept Identification: MetaMap vs. People”, AMIA 2003 Symposium Proceedings − Page 529.

Yuefeng Liu ,Minyong Shi, “Domain Ontology Concept Extraction Method Based on Text”,IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS),2016.

Downloads

Published

30.06.2022

How to Cite

R. Manimala. (2022). Hybridization of Max.C-Value and Permutation for Refined Concept Extraction. International Journal of Intelligent Systems and Applications in Engineering, 10(2s), 343–352. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/8007

Issue

Section

Research Article