4 dataset trovati

Tag: information-retrieval

  • WikiWord Thesaurus Data

    About Overview: The WikiWord-Thesaurus is a multilingual Thesaurus derived from Wikipedia by extracting lexical and semantic information. It was originally developed for a...
  • WikiWord

    About Overview: WikiWord is a system for building a multilingual Thesaurus by extracting lexical and semantic information from Wikipedia. It was originally developed for a...
  • Reuters-21578

    A set of documents from Reuters' 1986 newswire which have been classified. This dataset is appropriate for testing natural language processing and information retrieval...
  • RCV1-v2/LYRL2004

    This is a publicly available, tokenized version of the Reuters RCV1 corpus by David D Lewis et al. The creator requests attribution.
E' possibile inoltre accedere al registro usando il API (vedi Documentazione API).