Difference between revisions of "Resources for Norwegian"
Jump to navigation
Jump to search
m (→Copyleft) |
|||
(13 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
==Corpora== | ==Corpora== | ||
− | === | + | ===Free software=== |
− | + | ===Proprietary=== | |
− | |||
− | === | ||
* [http://corpora.informatik.uni-leipzig.de/ Norwegian plain text and Co-occurrences at LCC] ("the corpora may be used for scientific purposes only and not passed on to third parties") | * [http://corpora.informatik.uni-leipzig.de/ Norwegian plain text and Co-occurrences at LCC] ("the corpora may be used for scientific purposes only and not passed on to third parties") | ||
Line 14: | Line 12: | ||
===Free software=== | ===Free software=== | ||
− | * [http://apertium. | + | * [http://www.apertium.org Apertium] Norwegian Nynorsk<->Norwegian Bokmål, GPL v2 |
+ | ** [http://wiki.apertium.org/wiki/Apertium-nn-nb wiki] with installation information etc. | ||
===Proprietary=== | ===Proprietary=== | ||
+ | |||
+ | ==Lexical resources== | ||
+ | ===Free software=== | ||
+ | * [http://svn.emmtee.net/tags/topp/parc/pargram/norwegian/bokmal/bokmal-nkllex.lfg Bokmål LFG lexicon] with POS and count/mass, GPL | ||
+ | * [http://www.edd.uio.no/prosjekt/ordbanken/ Norsk ordbank], full form dictionaries for Nynorsk (106,789 lemmata) and Bokmål (142,899 lemmata), GPL | ||
+ | ** [http://savannah.nongnu.org/projects/ordbanken/ alternative download with cli lookup interface] | ||
+ | * [http://www.nb.no/spraakbanken/tilgjengelege-ressursar/leksikalske-databasar SCARRIE, Bokmål full form dictionary], XML, about 75,000 lemmata, CC-BY unported | ||
+ | |||
+ | ===Unknown license=== | ||
+ | * [http://www.nb.no/spraakbanken/tilgjengelege-ressursar/leksikalske-databasar "Leksikalsk database for norsk, opphavleg produsert av NST"], lexical database with SAMPA transcriptions, meant for speech technology | ||
+ | |||
+ | ==Parsing/disambiguation== | ||
+ | ===Free software=== | ||
+ | * [http://www.hf.uio.no/tekstlab/tagger.html Oslo-Bergen-taggeren], [[Constraint Grammar]] disambiguator, GPL | ||
+ | ** [https://github.com/noklesta/The-Oslo-Bergen-Tagger source and packages on github] | ||
+ | ** [http://maximos.aksis.uib.no/Aksis-wiki/Oslo-Bergen_Tagger older alternative download site] | ||
+ | ** [http://apertium.svn.sourceforge.net/viewvc/apertium/trunk/apertium-nn-nb/ the version used in Apertium] | ||
+ | ** [https://github.com/ogrim/clj-obt Clojure bindings] | ||
+ | |||
+ | |||
+ | * [http://www.hf.ntnu.no/hf/isk/Ansatte/petter.haugereid/norsyg.html Norsyg], [[HPSG]] grammar for Norwegian bokmål, LGPL. Implemented in [[LKB]], works with the full ''Norsk ordbank'' lexicon. | ||
+ | |||
+ | [[Category:Resources by language|Norwegian]] |
Revision as of 08:10, 23 February 2012
Corpora
Free software
Proprietary
- Norwegian plain text and Co-occurrences at LCC ("the corpora may be used for scientific purposes only and not passed on to third parties")
Timeline Analysis
Machine translation systems
Free software
Proprietary
Lexical resources
Free software
- Bokmål LFG lexicon with POS and count/mass, GPL
- Norsk ordbank, full form dictionaries for Nynorsk (106,789 lemmata) and Bokmål (142,899 lemmata), GPL
- SCARRIE, Bokmål full form dictionary, XML, about 75,000 lemmata, CC-BY unported
Unknown license
- "Leksikalsk database for norsk, opphavleg produsert av NST", lexical database with SAMPA transcriptions, meant for speech technology
Parsing/disambiguation
Free software
- Oslo-Bergen-taggeren, Constraint Grammar disambiguator, GPL