Skip to main content

Research Repository

Advanced Search

Translation techniques in cross-language information retrieval

Zhou, Dong; Truran, Mark; Brailsford, Tim; Wade, Vincent; Ashman, Helen


Dong Zhou

Mark Truran

Vincent Wade

Helen Ashman


Cross-language information retrieval (CLIR) is an active sub-domain of information retrieval (IR). Like IR, CLIR is centered on the search for documents and for information contained within those documents. Unlike IR, CLIR must reconcile queries and documents that are written in different languages. The usual solution to this mismatch involves translating the query and/or the documents before performing the search. Translation is therefore a pivotal activity for CLIR engines. Over the last 15 years, the CLIR community has developed a wide range of techniques and models supporting free text translation. This article presents an overview of those techniques, with a special emphasis on recent developments. © 2012 ACM.


Zhou, D., Truran, M., Brailsford, T., Wade, V., & Ashman, H. (2012). Translation techniques in cross-language information retrieval. ACM Computing Surveys, 45(1), 1-44.

Journal Article Type Review
Acceptance Date Jun 1, 2011
Publication Date Nov 1, 2012
Deposit Date Aug 14, 2018
Journal ACM Computing Surveys
Print ISSN 0360-0300
Electronic ISSN 1557-7341
Publisher Association for Computing Machinery (ACM)
Peer Reviewed Peer Reviewed
Volume 45
Issue 1
Pages 1-44
Keywords cross-language information retrieval, query translation, document translation, bilingual dictionary, parallel corpora, machine translation, semantic model
Public URL
Publisher URL