Large-Scale Pattern-Based Information Extraction from the World Wide Web/en
Large-Scale Pattern-Based Information Extraction from the World Wide Web
Type of Event:
Suppose you want to stay up-to-date with all relevant facts about a product on the market or quickly generate a list of interesting events at your next travel destination? This data is probably nowhere covered broader and up-to-date than on the Web. Wouldn’t it be convenient to simply aggregate the required facts from the Web?
Extracting information from text is the task of obtaining structured, machine-processable facts from information that is mentioned in an unstructured manner. It thus allows systems to automatically aggregate information for further analysis, efficient retrieval, automatic validation, or appropriate visualization. Information Extraction systems require a model that describes how to identify relevant target information in texts. These models need to be adapted to the exact nature of the target information and to the nature of the textual input, which is typically accomplished by means of Machine Learning techniques that generate such models based on examples.
The talk presents studies on using textual patterns for extracting information from the World Wide Web. We explore both fundamental design choices and practical applications.
The talk will be given in German language.
Start: 2009-11-18 at 3:45 pm
End: 2009-11-18 at 4:45 pm
Building: 11.40, Room: 231
iCal event: (iCal)
Host: Research group Web Science