5 найдено массивов данных

Теги: web

  • STAC

    A dataset to describe security mechanisms: security protocols, security tools, cryptographic concepts (encryption algorithms, hash functions, key management, etc.) in various...
  • Identi.ca Популярное

    About Identi.ca is a microblogging service. Users post short (140 character) notices which are broadcast to their friends and fans using the Web, RSS, or instant messages....
  • dotnetdotcom

    About We invite you to help us share the content of internet by downloading the first part of our index. It has roughly 600,000 pages and is shared in an easy to parse text...
  • A corpus of web crawl data composed of 5 billion web pages.

    A corpus of web crawl data composed of 5 billion web pages. This data set is freely available on Amazon S3 at s3://aws-publicdatasets/common-crawl/crawl-002/ and formatted in...
  • The ClueWeb09 Dataset

    The ClueWeb09 dataset was created to support research on information retrieval and related human language technologies. It consists of about 1 billion web pages in ten languages...
Вы можете получить доступ к этому реестру через API (see Документация API).