Mark Kröll and Roman Kern from Know-Center have designed an Open Information Extraction System for the German language. The associated publication has been published in the high-quality electronic journal J.UCS.

Open Information Extraction (OIE) allows to extract relations from a text without the need of domain-specific training data. To date, most of the research on OIE has been focused on the English language and little or no research has been conducted on other languages, including German.

To tackle this problem, we developed GerIE, an OIE system for the German language. We surveyed the literature on OIE in order to identify concepts that may apply to the German language. Our system is based on the output of a German dependency parser and a number of handcrafted rules to extract the propositions. To evaluate the system, we created two dedicated datasets: one derived from news articles and the other devised from texts from an encyclopedia. Our system achieves F-measures of up to 0.89 for correctly-preprocessed sentences.

Impact Factor: 0.566 / Q2 Journal

