Skip to main content

Research Repository

Advanced Search

Translation techniques in cross-language information retrieval

Zhou, Dong; Truran, Mark; Brailsford, Tim; Wade, Vincent; Ashman, Helen

Authors

Dong Zhou

Mark Truran

Vincent Wade

Helen Ashman



Abstract

Cross-language information retrieval (CLIR) is an active sub-domain of information retrieval (IR). Like IR, CLIR is centered on the search for documents and for information contained within those documents. Unlike IR, CLIR must reconcile queries and documents that are written in different languages. The usual solution to this mismatch involves translating the query and/or the documents before performing the search. Translation is therefore a pivotal activity for CLIR engines. Over the last 15 years, the CLIR community has developed a wide range of techniques and models supporting free text translation. This article presents an overview of those techniques, with a special emphasis on recent developments. © 2012 ACM.

Citation

Zhou, D., Truran, M., Brailsford, T., Wade, V., & Ashman, H. (2012). Translation techniques in cross-language information retrieval. ACM Computing Surveys, 45(1), 1-44. https://doi.org/10.1145/2379776.2379777

Journal Article Type Review
Acceptance Date Jun 1, 2011
Publication Date Nov 1, 2012
Deposit Date Aug 14, 2018
Journal ACM Computing Surveys
Print ISSN 0360-0300
Electronic ISSN 1557-7341
Publisher Association for Computing Machinery (ACM)
Peer Reviewed Peer Reviewed
Volume 45
Issue 1
Pages 1-44
DOI https://doi.org/10.1145/2379776.2379777
Keywords cross-language information retrieval, query translation, document translation, bilingual dictionary, parallel corpora, machine translation, semantic model
Public URL https://uwe-repository.worktribe.com/output/942263
Publisher URL http://dx.doi.org/10.1145/2379776.2379777