Saturday 4th of May 2024
 

A Method using Language Grid and Concept Base for Japanese-English Cross-language Information Retrieval


Pham Huy Anh and Yukawa Takashi

This paper describes query translation using language resources and a concept base method for Cross-language Information Retrieval (CLIR). In the proposed method, queries are translated by multiple machine translation systems on the Language Grid. The queries are then expanded by using a bilingual dictionary to translate compound words or word phrases. In addition, documents related to the translated query are retrieved with a TF-IDF term weighting model. The top 100 retrieved documents are re-ranked by a specificity-considered concept base with the noun phrases and compound words extracted from the query. The re-ranked results are combined with the results retrieved by the probabilistic model. For evaluation of the proposed method, we use the average precision of the non-interpolated recall and precision to compare our method with the NTCIR1 participation systems. The proposed method achieved the highest precision.

Keywords: Cross-language Information Retrieval, CLIR, Language resources, Concept base, Language Grid.

Download Full-Text


ABOUT THE AUTHORS

Pham Huy Anh
Pham Huy Anh received a B.S. in Electrical Engineering from the National Defense Academy and an M.S. in Electrical Engineering from the Nagaoka University of Technology. He is currently a student of doctor course in the Department of Electrical Engineering at the Nagaoka University of Technology.

Yukawa Takashi
Takashi Yukawa received a Master of Engineering degree from the Nagaoka University of Technology in 1987 and a Doctor of Informatics degree from Kyoto University in 2001. He has been involved in the research and development of a parallel computer for expert systems, a concept-sensitive information retrieval system and its application systems, knowledge management systems and an intelligent course management system for e-Learning. He is currently an associate professor in the Department of Electrical Engineering at the Nagaoka University of Technology.


IJCSI Published Papers Indexed By:

 

 

 

 
+++
About IJCSI

IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us
FAQs

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482
Email: info@ijcsi.org

More contact details »