A Method using Language Grid and Concept Base for Japanese-English Cross-language Information Retrieval
This paper describes query translation using language resources and a concept base method for Cross-language Information Retrieval (CLIR). In the proposed method, queries are translated by multiple machine translation systems on the Language Grid. The queries are then expanded by using a bilingual dictionary to translate compound words or word phrases. In addition, documents related to the translated query are retrieved with a TF-IDF term weighting model. The top 100 retrieved documents are re-ranked by a specificity-considered concept base with the noun phrases and compound words extracted from the query. The re-ranked results are combined with the results retrieved by the probabilistic model. For evaluation of the proposed method, we use the average precision of the non-interpolated recall and precision to compare our method with the NTCIR1 participation systems. The proposed method achieved the highest precision.
Keywords: Cross-language Information Retrieval, CLIR, Language resources, Concept base, Language Grid.
Download Full-Text
ABOUT THE AUTHORS
Pham Huy Anh
Pham Huy Anh received a B.S. in Electrical Engineering from the National Defense Academy and an M.S. in Electrical Engineering from the Nagaoka University of Technology. He is currently a student of doctor course in the Department of Electrical Engineering at the Nagaoka University of Technology.
Yukawa Takashi
Takashi Yukawa received a Master of Engineering degree from the Nagaoka University of Technology in 1987 and a Doctor of Informatics degree from Kyoto University in 2001. He has been involved in the research and development of a parallel computer for expert systems, a concept-sensitive information retrieval system and its application systems, knowledge management systems and an intelligent course management system for e-Learning. He is currently an associate professor in the Department of Electrical Engineering at the Nagaoka University of Technology.
Pham Huy Anh
Pham Huy Anh received a B.S. in Electrical Engineering from the National Defense Academy and an M.S. in Electrical Engineering from the Nagaoka University of Technology. He is currently a student of doctor course in the Department of Electrical Engineering at the Nagaoka University of Technology.
Yukawa Takashi
Takashi Yukawa received a Master of Engineering degree from the Nagaoka University of Technology in 1987 and a Doctor of Informatics degree from Kyoto University in 2001. He has been involved in the research and development of a parallel computer for expert systems, a concept-sensitive information retrieval system and its application systems, knowledge management systems and an intelligent course management system for e-Learning. He is currently an associate professor in the Department of Electrical Engineering at the Nagaoka University of Technology.