Etiqueta: language

Hi ha 32 conjunts de dades etiquetats amb language:

  • Language Commons
    • 7 views
    • None No té una llicència oberta
  • Apertium
    • 87 views
    • None No té una llicència oberta
    Description "Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance...
  • Dict.cc - English German Dictionary
    • 267 views
    • None No té una llicència oberta
    About From about page: dict.cc is not only an online dictionary. It's an attempt to create a platform where users from all over the world can share their knowledge in the field of...
  • Polygotte
    • 37 views
    • [Dades Obertes]
  • About From the creators: The purpose of this project is to create a free, open simple dictionary for students to use. The words in the dictionary will reviewed for quality and...
  • Wiktionary - EN
    • 121 views
    • [Dades Obertes]
    About Welcome to the English-language Wiktionary, a collaborative project to produce a free, multilingual dictionary with definitions, etymologies, pronunciations, sample quotations,...
  • TalkBank
    • 35 views
    • None No té una llicència oberta
    About About TalkBank: The goal of TalkBank is to foster fundamental research in the study of human and animal communication. It will construct sample databases within each of the...
  • About From website: As of November 2007, the European Commission's Directorate-General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis...
  • MOCHA-TIMIT
    • 77 views
    • None No té una llicència oberta
    About Authors: Alan Wrench, Queen Margaret University College. Funded by: Engineering and Physical Sciences Research Council. When created: November 1999....
  • The Speech Accent Archive
    • 27 views
    • None No té una llicència oberta
    From website: The speech accent archive uniformly presents a large set of speech samples from a variety of language backgrounds. Native and non-native speakers of English read the same...
  • About ISO 639-3 is a list of three letter codes for languages: ISO 639-3 attempts to provide as complete an enumeration of languages as possible, including living, extinct, ancient, and...
  • Wordnet
    • 69 views
    • [Dades Obertes]
    About From website: WordNet® is a large lexical database of English, developed under the direction of George A. Miller. Nouns, verbs, adjectives and adverbs are grouped into sets of...
  • Perseus Digital Library
    • 41 views
    • None No té una llicència oberta
    Description Started in 1987 focusing on classics but has now expanded to other areas. Though containing a large amount of material site itself is focused on being a digital library (i.e....
  • As of November 2007, the European Commission's Directorate-General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis Communautaire (the...
  • About From website: Welcome to fsi-language-courses.com, the home for language courses developed by the Foreign Service Institute. These courses were developed by the United States...
  • About From the about page: The Rosetta Project is a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human...
  • The IBL Corpus
    • 15 views
    • None No té una llicència oberta
    About The IBL Corpus was collected by the University of Plymouth and the University of Edinburgh as part of the EPSRC funded project IBL, Instruction-based Learning for Mobile Robots...
  • About From website: WordNet is a lexical database for English that has been widely adopted in artificial intelligence and computational linguistics for a variety of practical...
  • FrameNet
    • 154 views
    • None No té una llicència oberta
    About From website: The Berkeley FrameNet project is creating an on-line lexical resource for English, based on frame semantics and supported by corpus evidence. The aim is to document...
  • This is a set of ISO codes including those for country and currency collected together into a useful package by the Debian project. From the package page: This package provides the...
  • POS Tagger for Romanian Language
    • 206 views
    • None No té una llicència oberta
    We have developed a hidden Markov model-based part-of-speech tagger for the Romanian language. Our interactive web tool is located at...
  • Description Lingvoj means languages in Esperanto. From the frontpage of http://www.lingvoj.org/: Since the launch of lingvoj.org in 2007, the linked data cloud has grown at a steady...
  • CoGrOO - a Portuguese Grammar Checker to OpenOffice.org
    • 9 views
    • None No té una llicència oberta
  • About Overview: The WikiWord-Thesaurus is a multilingual Thesaurus derived from Wikipedia by extracting lexical and semantic information. It was originally developed for a diploma thesis...
  • WikiWord
    • 29 views
    • None No té una llicència oberta
    About Overview: WikiWord is a system for building a multilingual Thesaurus by extracting lexical and semantic information from Wikipedia. It was originally developed for a diploma thesis...
  • LanguageTool
    • 19 views
    • None No té una llicència oberta
    An Open Source language checker for English, German, Polish, Dutch, and other languages. This is a rule-based language checker that will find errors for which a rule is defined in its XML...
  • Fred Jehle, formerly a professor at Indiana University-Purdue University Fort Wayne, published approximately 600 verbs, fully conjugated in all moods and tenses, on his website in 1998....
  • About Data exposed: Linguistic Data Size of dump and data set: ~40MB Openness Download dump: CC-BY-SA 3.0 license The web service additionally provides some parts that are not fully...
  • RDF conversion of Princeton's package:wordnet, version 3.0. With many links to package:w3c-wordnet, package:lexvo and the Dutch package:cornetto.
  • About From website: WALS is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars)...
  • Description Overview from home page: The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 11 European languages: Romanic...
  • Cornetto
    • 584 views
    • None No té una llicència oberta
    Dutch lexical database, similar to WordNet but with more semantic relations. Links to package:vu-wordnet and package:w3c-wordnet. When this dataset is used for research purposes,...