Please use this identifier to cite or link to this item:
|Title||Cross Language Concept Mining|
We present a method which, given a few words defining a concept in some language , retrieves, disambiguates and extends corresponding terms that define a similar concept in another specified language. This can be very useful for cross-lingual information retrieval and the preparation of multilingual lexical resources. We automatically obtain term translations from multilingual dictionaries and disambiguate them using web counts. We then retrieve web snippets with co-occurring translations, and discover additional concept terms from these snippets. Our term discovery is based on co-appearance of similar words in symmetric patterns. We evaluate our method on a set of language pairs involving 45 languages, including combinations of very dissimilar ones such as Russian, Chinese, and He-brew for various concepts. We assess the quality of the retrieved sets using both human judgments and automatically comparing the obtained categories to corresponding English WordNet synsets.
|Published in||Annual PhD students conference IAEM Lorraine - APIL 2013|
|Publisher||University of Lorraine|
|Item link||Item Link|
|Files in this item|