Show simple item record

The CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.2

CreatorLjubešić, Nikola
Date2020-09-15T10:51:00Z
dc.date.accessioned2021-07-24T21:28:04Z
dc.date.available2021-07-24T21:28:04Z
Identifierhttp://hdl.handle.net/11356/1354
dc.identifier.urihttps://linghub.org/handle/123456789/924998
DescriptionThe model for lemmatisation of standard Slovenian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the ssj500k training corpus (http://hdl.handle.net/11356/1210) and using the Sloleks inflectional lexicon (http://hdl.handle.net/11356/1230). The estimated F1 of the lemma annotations is ~99.0. The difference to the previous version is that now it relies solely on XPOS annotations, and not on a combination of UPOS, FEATS (lexicon lookup) and XPOS (lemma prediction) annotations.
PublisherJožef Stefan Institute
Rightshttps://creativecommons.org/licenses/by-sa/4.0/
RightsCreative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Subjectlemmatisation
Subjectlanguage model
TitleThe CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.2
TypetoolService
TypeSoftware
dcterms.available2020-09-15T10:51:00Z
dcterms.bibliographicCitationhttp://hdl.handle.net/11356/1354
dcterms.creatorLjubešić, Nikola
dcterms.date2020-09-15T10:51:00Z
dcterms.descriptionThe model for lemmatisation of standard Slovenian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the ssj500k training corpus (http://hdl.handle.net/11356/1210) and using the Sloleks inflectional lexicon (http://hdl.handle.net/11356/1230). The estimated F1 of the lemma annotations is ~99.0. The difference to the previous version is that now it relies solely on XPOS annotations, and not on a combination of UPOS, FEATS (lexicon lookup) and XPOS (lemma prediction) annotations.
dcterms.identifierhttp://hdl.handle.net/11356/1354
dcterms.isReplacedByhttp://hdl.handle.net/11356/1412
dcterms.publisherJožef Stefan Institute
dcterms.replaceshttp://hdl.handle.net/11356/1286
dcterms.rightshttps://creativecommons.org/licenses/by-sa/4.0/
dcterms.rightsCreative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dcterms.subjectlemmatisation
dcterms.subjectlanguage model
dcterms.titleThe CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.2
dcterms.typetoolService
dcterms.typeSoftware
odrl.Policyhttp://purl.org/net/rdflicense/cc-by-sa4.0


Check resource access

Authorized
Reason

Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • OLAC
    Main data from the OLAC dataset

Show simple item record


Copyright  © 2020 All Rights Reserved by Prêt-à-LLOD Project.

Horizon 2020

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182.