Stage-oe-small.jpg

Inproceedings2011: Unterschied zwischen den Versionen

Aus Aifbportal
Wechseln zu:Navigation, Suche
K (Added from ontology)
K (Added from ontology)
Zeile 2: Zeile 2:
 
|ErsterAutorNachname=Cimiano
 
|ErsterAutorNachname=Cimiano
 
|ErsterAutorVorname=Philipp
 
|ErsterAutorVorname=Philipp
 +
}}
 +
{{Publikation Author
 +
|Rank=5
 +
|Author=Steffen  Staab
 
}}
 
}}
 
{{Publikation Author
 
{{Publikation Author
 
|Rank=3
 
|Rank=3
 
|Author=Sergej Sizov
 
|Author=Sergej Sizov
}}
 
{{Publikation Author
 
|Rank=2
 
|Author=Antje Schultz
 
 
}}
 
}}
 
{{Publikation Author
 
{{Publikation Author
Zeile 16: Zeile 16:
 
}}
 
}}
 
{{Publikation Author
 
{{Publikation Author
|Rank=5
+
|Rank=2
|Author=Steffen  Staab
+
|Author=Antje Schultz
 
}}
 
}}
 
{{Inproceedings
 
{{Inproceedings
Zeile 39: Zeile 39:
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Informationssysteme
+
|Forschungsgebiet=Information Retrieval
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Text Mining
+
|Forschungsgebiet=Informationsextraktion
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Information Retrieval
+
|Forschungsgebiet=Natürliche Sprachverarbeitung
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Natürliche Sprachverarbeitung
+
|Forschungsgebiet=Digitale Bibliotheken
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Informationsextraktion
+
|Forschungsgebiet=Informationssysteme
 
}}
 
}}
 
{{Forschungsgebiet Auswahl
 
{{Forschungsgebiet Auswahl
|Forschungsgebiet=Digitale Bibliotheken
+
|Forschungsgebiet=Text Mining
 
}}
 
}}

Version vom 11. September 2009, 01:26 Uhr


Explicit vs. Latent Concept Models for Cross-Language Information Retrieval


Explicit vs. Latent Concept Models for Cross-Language Information Retrieval



Published: 2009 Juli

Buchtitel: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI)

Referierte Veröffentlichung

BibTeX

Kurzfassung
The field of information retrieval and text manipulation (classification, clustering) still strives for models allowing semantic information to be folded in to improve performance with respect to standard bag-of-word based models. Many approaches aim at a concept-based retrieval, but differ in the nature of the concepts, which range from linguistic concepts as defined in lexical resources such as WordNet, latent topics derived from the data itself - as in Latent Semantic Indexing (LSI) or (Latent Dirichlet Allocation (LDA) - to Wikipedia articles as proxies for concepts, as in the recently proposed Explicit Semantic Analysis (ESA) model. A crucial question which has not been answered so far is whether models based on explicitly given concepts (as in the ESA model for instance) perform inherently better than retrieval models based on "latent" concepts (as in LSI and/or LDA). In this paper we investigate this question closer in the context of a cross-language setting, which inherently requires concept-based retrieval bridging between different languages. In particular, we compare the recently proposed ESA model with two latent models (LSI and LDA) showing that the former is clearly superior to the both. From a general perspective, our results contribute to clarifying the role of explicit vs. implicitly derived or latent concepts in (crosslanguage) information retrieval research.

Download: Media:2009_2011_Cimiano_Explicit_vs._La_1.pdf

Projekt

Multipla



Forschungsgebiet

Information Retrieval, Text Mining, Informationssysteme, Informationsextraktion, Natürliche Sprachverarbeitung, Digitale Bibliotheken