Show simple item record

dc.contributor.authorWong, May Lai-Yin
dc.date.accessioned2020-12-28T09:34:23Z
dc.date.available2020-12-28T09:34:23Z
dc.date.issued2005
dc.identifier.citationAnuario del Seminario de Filología Vasca Julio de Urquijo 39(2) : 271-287 (2005)
dc.identifier.issn0582-6152
dc.identifier.urihttp://hdl.handle.net/10810/49321
dc.description.abstractThe approach taken in this paper for the construction of a treebank is inspired by the skeleton parsing approach. From the PFR Chinese Corpus, a sample text of some 100,000 word tokens was chosen for the production of the treebank. A clear account of the 17 non terminal constituents that are defined and instantiated in the corpus texts will be provided in a parsing scheme. A set of parsing guidelines on practical issues related to map any parses on to sentences in the application of the parsing scheme will also be considered. It is noteworthy also to discuss the major difficulties encountered in the course of skeleton parsing, as this illuminates some of the peculiarities of the Chinese language. The conclusion is an evaluation of the success of the treebank compilation.
dc.language.isoeng
dc.publisherServicio Editorial de la Universidad del País Vasco/Euskal Herriko Unibertsitatearen Argitalpen Zerbitzua
dc.rightsinfo:eu-repo/semantics/openAccess
dc.titleThe compilation of a sample PFR Chinese corpus of Skeleton-parsed sentences
dc.typeinfo:eu-repo/semantics/article
dc.rights.holder© 2005, Servicio Editorial de la Universidad del País Vasco Euskal Herriko Unibertsitateko Argitalpen Zerbitzua


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record