Stage-oe-small.jpg

Techreport783: Unterschied zwischen den Versionen

Aus Aifbportal
Wechseln zu:Navigation, Suche
K (Added from ontology)
 
K (Wikipedia python library)
 
(4 dazwischenliegende Versionen von 2 Benutzern werden nicht angezeigt)
Zeile 1: Zeile 1:
{{Publikation Author
+
{{Publikation Erster Autor
|Rank=3
+
|ErsterAutorNachname=Cimiano
|Author=Steffen Staab
+
|ErsterAutorVorname=Philipp
 
}}
 
}}
 
{{Publikation Author
 
{{Publikation Author
|Rank=1
+
|Rank=2
|Author=Philipp Cimiano
+
|Author=Andreas Hotho
 
}}
 
}}
 
{{Publikation Author
 
{{Publikation Author
|Rank=2
+
|Rank=3
|Author=Andreas Hotho
+
|Author=Steffen Staab
 
}}
 
}}
 
{{Techreport
 
{{Techreport
Zeile 17: Zeile 17:
 
|Month=November
 
|Month=November
 
|Institution=Insitute AIFB, University of Karlsruhe
 
|Institution=Insitute AIFB, University of Karlsruhe
|ID Number=783
+
|Archivierungsnummer=783
 
}}
 
}}
 
{{Publikation Details
 
{{Publikation Details
Zeile 38: Zeile 38:
 
a particular smoothing technique to cope with data sparseness.
 
a particular smoothing technique to cope with data sparseness.
 
|VG Wort-Seiten=
 
|VG Wort-Seiten=
|Download=2004_783_Cimiano_Learning Concep_1.pdf, 2004_783_Cimiano_Learning Concep_2.ps
+
|Download=2004_783_Cimiano_Learning_Concep_1.pdf, 2004_783_Cimiano_Learning_Concep_1.ps
|Forschungsgebiet=Ontology Learning,
 
 
|Projekt=Dot.Kom,  
 
|Projekt=Dot.Kom,  
|Forschungsgruppe=
+
|Forschungsgruppe=Wissensmanagement
 +
}}
 +
{{Forschungsgebiet Auswahl
 +
|Forschungsgebiet=Ontology Learning
 
}}
 
}}

Aktuelle Version vom 16. Oktober 2009, 17:22 Uhr


Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis




Published: 2004 November
Institution: Insitute AIFB, University of Karlsruhe
Archivierungsnummer: 783

BibTeX



Kurzfassung
We present a novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus. The approach is based on Formal Concept Analysis (FCA), a method mainly used for the analysis of data, i.e. for investigating and processing explicitly given information. We follow Harris' distributional hypothesis and model the context of a certain term as a vector representing syntactic dependencies which are automatically acquired from the text corpus with a linguistic parser. On the basis of this context information, FCA produces a lattice that we convert into a special kind of partial order constituting a concept hierarchy. The approach is evaluated by comparing the resulting concept hierarchies with hand-crafted taxonomies for two domains: tourism and finance. We also directly compare our approach with hierarchical agglomerative clustering as well as with Bi-Section-KMeans as an instance of a divisive clustering algorithm. Furthermore, we investigate the impact of using different measures weighting the contribution of each attribute as well as of applying a particular smoothing technique to cope with data sparseness.

Download: Media:2004_783_Cimiano_Learning_Concep_1.pdf,Media:2004_783_Cimiano_Learning_Concep_1.ps

Projekt

Dot.Kom



Forschungsgruppe

Wissensmanagement


Forschungsgebiet

Ontology Learning