Difference between revisions of "Resources for Norwegian"
Jump to navigation
Jump to search
Line 22: | Line 22: | ||
* [http://www.edd.uio.no/prosjekt/ordbanken/ Norsk ordbank], full form dictionaries for Nynorsk (106,789 lemmata) and Bokmål (142,899 lemmata), GPL | * [http://www.edd.uio.no/prosjekt/ordbanken/ Norsk ordbank], full form dictionaries for Nynorsk (106,789 lemmata) and Bokmål (142,899 lemmata), GPL | ||
** [http://savannah.nongnu.org/projects/ordbanken/ alternative download with cli lookup interface] | ** [http://savannah.nongnu.org/projects/ordbanken/ alternative download with cli lookup interface] | ||
+ | * [http://www.nb.no/spraakbanken/tilgjengelege-ressursar/leksikalske-databasar SCARRIE, Bokmål full form dictionary), XML, about 75,000 lemmata, CC-BY unported | ||
==Parsing/disambiguation== | ==Parsing/disambiguation== |
Revision as of 07:06, 23 February 2012
Corpora
Free software
Proprietary
- Norwegian plain text and Co-occurrences at LCC ("the corpora may be used for scientific purposes only and not passed on to third parties")
Timeline Analysis
Machine translation systems
Free software
Proprietary
Lexical resources
Free software
- Bokmål LFG lexicon with POS and count/mass, GPL
- Norsk ordbank, full form dictionaries for Nynorsk (106,789 lemmata) and Bokmål (142,899 lemmata), GPL
- [http://www.nb.no/spraakbanken/tilgjengelege-ressursar/leksikalske-databasar SCARRIE, Bokmål full form dictionary), XML, about 75,000 lemmata, CC-BY unported
Parsing/disambiguation
Free software
- Oslo-Bergen-taggeren, Constraint Grammar disambiguator, GPL