1 dataset found

Tags: lm

  • The ClueWeb09 Dataset

    The ClueWeb09 dataset was created to support research on information retrieval and related human language technologies. It consists of about 1 billion web pages in ten languages...
You can also access this registry using the API (see API Docs).