PANACEA English V-SUBCAT gold-standard for LAB domain

Instance of: Resource Info
Description This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of 29 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manually annotated. The sentences were selected from crawled Web pages that were automatically detected to be in the English language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011. This gold-standard was created in the context of PANACEA http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.
Language en
Language English
Rights CC-BY-SA
See Also http://metashare.elda.org/repository/browse/d737a72280c211e28763000c291ecfc8d948026400dd4886abf0dd8b2f0f6e4c/
Source META-SHARE
Title PANACEA English V-SUBCAT gold-standard for LAB domain
Type Dataset
Type Lexical Conceptual Resource
Is Is Replaced By of PANACEA English V-SUBCAT gold-standard for LAB domain
PANACEA English V-SUBCAT gold-standard for LAB domain

Contact Point

Affiliation
Communication Info
Address 9 West Road
City Cambridge
Country United Kingdom
Distribution Metashare/d737a72280c211e28763000c291ecfc8d948026400dd4886abf0dd8b2f0f6e4c#Dist URL2
Email alk23@cam.ac.uk
Fax Number +44 0 1223 335062
Telephone Number +44 0 1223 767389
Type Communication Info
Zip Code CB3 9DP
Department Name Department of Theoretical and Applied Linguistics
Organization Name University of Cambridge. Department of Theoretical and Applied Linguistics
Organization Short Name CAM-DTAL
Type Organization Info Type
Communication Info
Address 9 West Road
City Cambridge
Country United Kingdom
Email alk23@cam.ac.uk
Type Communication Info
Zip Code CB3 9DP
Given Name Anna
Surname Korhonen
Type Contact Person
Person
Person Info Type

Distribution Info

Availability Available-unrestricted Use
License
Delivery Channel Downloadable
Same As https://creativecommons.org/licenses/by-sa/4.0/
Type Licence Info
Type Distribution
Distribution Info

Identification Info

Description This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of 29 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manually annotated. The sentences were selected from crawled Web pages that were automatically detected to be in the English language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011. This gold-standard was created in the context of PANACEA http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.
Distribution
Access URL http://hdl.handle.net/10230/20170
Type Distribution
URL
Identifier http://hdl.handle.net/10230/20170
Meta Share Id NOT_DEFINED_FOR_V2
Resource Short Name English V-SUBCAT GS lexicon ENV domain
Title PANACEA English V-SUBCAT gold-standard for LAB domain
Type Identification Info

Lexical Conceptual Resource Info

Lexical Conceptual Resource Media Type
Lexical Conceptual Resource Text Info
Character Encoding Info
Character Encoding UTF-8
Type Character Encoding Info
Domain Info
Domain labour legislation
Type Domain Info
Language Info
Language English
Language en
Language Name English
Type Language Info
Linguality Info
Linguality Type Monolingual
Type Linguality Info
Media Type Text
Size Info
Size 200
Size Unit Sentences
Type Size Info Type
Size 29
Size Unit Entries
Type Size Info Type
Text Format Info
Mime Type text/xml
Type Text Format Info
Type Lexical Conceptual Resource Text Info
Type Lexical Conceptual Resource Media Type
Lexical Conceptual Resource Type Lexicon
Resource Type Lexical Conceptual Resource
Type Lexical Conceptual Resource Info

Resource Creation Info

Creator
Organization Info
Communication Info
Address 9 West Road
City Cambridge
Country United Kingdom
Distribution
Access URL http://www.mml.cam.ac.uk/dtal/
Type Distribution
URL
Email alk23@cam.ac.uk
Fax Number +44 0 1223 335062
Telephone Number +44 0 1223 767389
Type Communication Info
Zip Code CB3 9DP
Department Name Department of Theoretical and Applied Linguistics
Organization Name University of Cambridge. Department of Theoretical and Applied Linguistics
Organization Short Name CAM-DTAL
Type Organization Info Type
Type Actor
Funding Project
Distribution
Access URL http://panacea-lr.eu/
Type Distribution
URL
Funder European Union
Funding Type Eu Funds
Project Name PANACEA
Project Short Name PANACEA
Type Project Info Type
Type Resource Creation Info

Validation Info

Type Validation Info
Validated true Boolean
Validation Mode Automatic
Validation Mode Details The lexicon validates against the LMF DTD v.16
Validation Type Formal

Metashare/d737a72280c211e28763000c291ecfc8d948026400dd4886abf0dd8b2f0f6e4c#metadata Info

Instance of: Catalog Record
Created 2013-01-16 Date
Language en
Language English
Modified 2013-01-16 Date
Original Metadata Schema v3.0
Primary Topic PANACEA English V-SUBCAT gold-standard for LAB domain
Source METASHARE
Type Metadata Info

Creator

Type Actor

Metashare/d737a72280c211e28763000c291ecfc8d948026400dd4886abf0dd8b2f0f6e4c#Header

Instance of: Catalog Record
Issued 2014-09-23T00:16:17Z Date
Primary Topic PANACEA English V-SUBCAT gold-standard for LAB domain
Set Spec lexicalConceptualResource:lexicon
lexicalConceptualResource