Stikkord: language

Det er 32 datasett merket med language:

  • Language Commons
    • 7 views
    • None Har ikke åpen lisens
  • Apertium
    • 87 views
    • None Har ikke åpen lisens
    Description "Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance...
  • Dict.cc - English German Dictionary
    • 267 views
    • None Har ikke åpen lisens
    About From about page: dict.cc is not only an online dictionary. It's an attempt to create a platform where users from all over the world can share their knowledge in the field of...
  • Polygotte
    • 37 views
    • [Åpne data]
  • About From the creators: The purpose of this project is to create a free, open simple dictionary for students to use. The words in the dictionary will reviewed for quality and...
  • Wiktionary - EN
    • 121 views
    • [Åpne data]
    About Welcome to the English-language Wiktionary, a collaborative project to produce a free, multilingual dictionary with definitions, etymologies, pronunciations, sample quotations,...
  • TalkBank
    • 35 views
    • None Har ikke åpen lisens
    About About TalkBank: The goal of TalkBank is to foster fundamental research in the study of human and animal communication. It will construct sample databases within each of the...
  • About From website: As of November 2007, the European Commission's Directorate-General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis...
  • MOCHA-TIMIT
    • 77 views
    • None Har ikke åpen lisens
    About Authors: Alan Wrench, Queen Margaret University College. Funded by: Engineering and Physical Sciences Research Council. When created: November 1999....
  • The Speech Accent Archive
    • 27 views
    • None Har ikke åpen lisens
    From website: The speech accent archive uniformly presents a large set of speech samples from a variety of language backgrounds. Native and non-native speakers of English read the same...
  • About ISO 639-3 is a list of three letter codes for languages: ISO 639-3 attempts to provide as complete an enumeration of languages as possible, including living, extinct, ancient, and...
  • Wordnet
    • 69 views
    • [Åpne data]
    About From website: WordNet® is a large lexical database of English, developed under the direction of George A. Miller. Nouns, verbs, adjectives and adverbs are grouped into sets of...
  • Perseus Digital Library
    • 41 views
    • None Har ikke åpen lisens
    Description Started in 1987 focusing on classics but has now expanded to other areas. Though containing a large amount of material site itself is focused on being a digital library (i.e....
  • As of November 2007, the European Commission's Directorate-General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis Communautaire (the...
  • About From website: Welcome to fsi-language-courses.com, the home for language courses developed by the Foreign Service Institute. These courses were developed by the United States...
  • About From the about page: The Rosetta Project is a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human...
  • The IBL Corpus
    • 15 views
    • None Har ikke åpen lisens
    About The IBL Corpus was collected by the University of Plymouth and the University of Edinburgh as part of the EPSRC funded project IBL, Instruction-based Learning for Mobile Robots...
  • About From website: WordNet is a lexical database for English that has been widely adopted in artificial intelligence and computational linguistics for a variety of practical...
  • FrameNet
    • 154 views
    • None Har ikke åpen lisens
    About From website: The Berkeley FrameNet project is creating an on-line lexical resource for English, based on frame semantics and supported by corpus evidence. The aim is to document...
  • This is a set of ISO codes including those for country and currency collected together into a useful package by the Debian project. From the package page: This package provides the...
  • POS Tagger for Romanian Language
    • 206 views
    • None Har ikke åpen lisens
    We have developed a hidden Markov model-based part-of-speech tagger for the Romanian language. Our interactive web tool is located at...
  • Description Lingvoj means languages in Esperanto. From the frontpage of http://www.lingvoj.org/: Since the launch of lingvoj.org in 2007, the linked data cloud has grown at a steady...
  • About Overview: The WikiWord-Thesaurus is a multilingual Thesaurus derived from Wikipedia by extracting lexical and semantic information. It was originally developed for a diploma thesis...
  • WikiWord
    • 29 views
    • None Har ikke åpen lisens
    About Overview: WikiWord is a system for building a multilingual Thesaurus by extracting lexical and semantic information from Wikipedia. It was originally developed for a diploma thesis...
  • LanguageTool
    • 19 views
    • None Har ikke åpen lisens
    An Open Source language checker for English, German, Polish, Dutch, and other languages. This is a rule-based language checker that will find errors for which a rule is defined in its XML...
  • Fred Jehle, formerly a professor at Indiana University-Purdue University Fort Wayne, published approximately 600 verbs, fully conjugated in all moods and tenses, on his website in 1998....
  • About Data exposed: Linguistic Data Size of dump and data set: ~40MB Openness Download dump: CC-BY-SA 3.0 license The web service additionally provides some parts that are not fully...
  • RDF conversion of Princeton's package:wordnet, version 3.0. With many links to package:w3c-wordnet, package:lexvo and the Dutch package:cornetto.
  • About From website: WALS is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars)...
  • Description Overview from home page: The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 11 European languages: Romanic...
  • Cornetto
    • 584 views
    • None Har ikke åpen lisens
    Dutch lexical database, similar to WordNet but with more semantic relations. Links to package:vu-wordnet and package:w3c-wordnet. When this dataset is used for research purposes,...