To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
Languages
Recent
Show all languages
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Cross-language information retrieval

From Wikipedia, the free encyclopedia


Cross-language information retrieval (CLIR) is a subfield of information retrieval dealing with retrieving information written in a language different from the language of the user's query.[1] The term "cross-language information retrieval" has many synonyms, of which the following are perhaps the most frequent: cross-lingual information retrieval, translingual information retrieval, multilingual information retrieval. The term "multilingual information retrieval" refers more generally both to technology for retrieval of multilingual collections and to technology which has been moved to handle material in one language to another. The term Multilingual Information Retrieval (MLIR) involves the study of systems that accept queries for information in various languages and return objects (text, and other media) of various languages, translated into the user's language. Cross-language information retrieval refers more specifically to the use case where users formulate their information need in one language and the system retrieves relevant documents in another. To do so, most CLIR systems use various translation techniques. CLIR techniques can be classified into different categories based on different translation resources:[2]

  • Dictionary-based CLIR techniques
  • Parallel corpora based CLIR techniques
  • Comparable corpora based CLIR techniques
  • Machine translator based CLIR techniques

CLIR systems have improved so much that the most accurate multi-lingual and cross-lingual adhoc information retrieval systems today are nearly as effective as monolingual systems.[3] Other related information access tasks, such as media monitoring, information filtering and routing, sentiment analysis, and information extraction require more sophisticated models and typically more processing and analysis of the information items of interest. Much of that processing needs to be aware of the specifics of the target languages it is deployed in.

Mostly, the various mechanisms of variation in human language pose coverage challenges for information retrieval systems: texts in a collection may treat a topic of interest but use terms or expressions which do not match the expression of information need given by the user. This can be true even in a mono-lingual case, but this is especially true in cross-lingual information retrieval, where users may know the target language only to some extent. The benefits of CLIR technology for users with poor to moderate competence in the target language has been found to be greater than for those who are fluent.[4] Specific technologies in place for CLIR services include morphological analysis to handle inflection, decompounding or compound splitting to handle compound terms, and translations mechanisms to translate a query from one language to another.

The first workshop on CLIR was held in Zürich during the SIGIR-96 conference.[5] Workshops have been held yearly since 2000 at the meetings of the Cross Language Evaluation Forum (CLEF). Researchers also convene at the annual Text Retrieval Conference (TREC) to discuss their findings regarding different systems and methods of information retrieval, and the conference has served as a point of reference for the CLIR subfield.[6] Early CLIR experiments were conducted at TREC-6, held at the National Institute of Standards and Technology (NIST) on November 19–21, 1997.[7]

Google Search had a cross-language search feature that was removed in 2013.[8]

YouTube Encyclopedic

  • 1/3
    Views:
    5 345
    12 883
    937
  • Lecture 10: Introduction to Information Retrieval
  • How Are Words Connected in our Minds? Priming
  • Signature files in Information Retrieval

Transcription

See also

  • EXCLAIM (EXtensible Cross-Linguistic Automatic Information Machine)
  • CLEF (Conference and Labs of the Evaluation Forum, formerly known as Cross-Language Evaluation Forum)

References

  1. ^ Wang, Jianqiang, and Douglas W. Oard. "Matching meaning for cross-language information retrieval." Information Processing & Management48.4 (2012): 631-53.
  2. ^ Tsai, Peishan. "An Introduction to Cross-Language Information Retrieval Approaches". www.mikeandpeishan.com. Archived from the original on 2022-11-04. Retrieved 2022-11-04.
  3. ^ Oard, Douglas. "Multilingual Information Access." Understanding Information Retrieval Systems(2011): 373-80. Web.
  4. ^ Airio, Eija (2008). "Who benefits from CLIR in web retrieval?". Journal of Documentation. 64 (5): 760–778. doi:10.1108/00220410810899754.
  5. ^ The proceedings of this workshop can be found in the book Cross-Language Information Retrieval (Grefenstette, ed; Kluwer, 1998) ISBN 0-7923-8122-X.
  6. ^ Olvera-Lobo, María-Dolores. "Cross-Language Information Retrieval on the Web." Handbook of Research on Social Dimensions of Semantic Technologies and Web Services(n.d.): 704-19. Web.
  7. ^ Vorhees, Ellen M.; Harman, Donna (1999). "Overview of the Sixth Text REtrieval Conference (TREC-6)". Information Processing and Management.
  8. ^ "Google Drops "Translated Foreign Pages" Search Option Due To Lack Of Use". 20 May 2013.

External links


This page was last edited on 22 April 2024, at 09:20
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.