Show simple item record

dc.contributor.authorHernández González, Jerónimo ORCID
dc.contributor.authorRodríguez, Daniel
dc.contributor.authorInza Cano, Iñaki ORCID
dc.contributor.authorHarrison, Rachel
dc.contributor.authorLozano Alonso, José Antonio
dc.date.accessioned2018-11-15T11:16:59Z
dc.date.available2018-11-15T11:16:59Z
dc.date.issued2018-06
dc.identifier.citationData In Brief 18 : 840-845 (2018)es_ES
dc.identifier.issn2352-3409
dc.identifier.urihttp://hdl.handle.net/10810/29671
dc.description.abstractClassifying software defects according to any defined taxonomy is not straightforward. In order to be used for automatizing the classification of software defects, two sets of defect reports were collected from public issue tracking systems from two different real domains. Due to the lack of a domain expert, the collected defects were categorized by a set of annotators of unknown reliability according to their impact from IBM's orthogonal defect classification taxonomy. Both datasets are prepared to solve the defect classification problem by means of techniques of the learning from crowds paradigm (Hernández-González et al. [1]). Two versions of both datasets are publicly shared. In the first version, the raw data is given: the text description of defects together with the category assigned by each annotator. In the second version, the text of each defect has been transformed to a descriptive vector using text-mining techniques.es_ES
dc.description.sponsorshipThis work has been partially supported by the Basque Government(IT609-13,ElkartekBID3A), the Spanish Ministry of Economy and Competitiveness(TIN2016-78365-R) and the University-Society Project15/19(Basque Government and University of the Basque Country UPV/EHU).JoseA.Lozano is also supported by BERC Program 2014–2017(Basque Government) and Severo Ochoa Program SEV- 2013-0323 (Spanish Ministry of Economy and Competitiveness).Daniel Rodriguez carriedo utthis work while visiting Oxford Brookes University.He is partlys upported by projects Badge People TIN2016–76956-C3-3-R.Wewould like to thank Varsha Veerappa and the anony mousannotators for their help with data collection.es_ES
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/TIN2016-78365-Res_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/SEV-2013-0323
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.titleTwo Datasets of Defect Reports Labeled By a Crowd of Annotators of Unknown Reliabilityes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.holder2018 The Authors.Published by Elsevier Inc.This is an open access article under the CCBY license (http://creativecommons.org/licenses/by/4.0/).es_ES
dc.rights.holderAtribución 3.0 España*
dc.relation.publisherversionhttps://www.sciencedirect.com/science/article/pii/S2352340918303226?via%3Dihubes_ES
dc.identifier.doi10.1016/j.dib.2018.03.109
dc.departamentoesCiencia de la computación e inteligencia artificiales_ES
dc.departamentoeuKonputazio zientziak eta adimen artifizialaes_ES


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

2018 The Authors.Published by Elsevier Inc.This is an open access article under the CCBY license
(http://creativecommons.org/licenses/by/4.0/).
Except where otherwise noted, this item's license is described as 2018 The Authors.Published by Elsevier Inc.This is an open access article under the CCBY license (http://creativecommons.org/licenses/by/4.0/).