A Dataset for Pull-based Development Research

TitleA Dataset for Pull-based Development Research
Publication TypeConference Paper
Year of Publication2014
AuthorsGousios, G, Zaidman, A
Secondary TitleProceedings of the 11th Working Conference on Mining Software Repositories
Pagination368–371
PublisherACM
Place PublishedNew York, NY, USA
ISBN Number978-1-4503-2863-0
KeywordsDistributed software development, Empirical software engineering, msr data showcase, pull request, pull-based development
Abstract

Pull requests form a new method for collaborating in distributed software development. To study the pull request distributed development model, we constructed a dataset of almost 900 projects and 350,000 pull requests, including some of the largest users of pull requests on Github. In this paper, we describe how the project selection was done, we analyze the selected features and present a machine learning tool set for the R statistics environment.

URLhttp://doi.acm.org/10.1145/2597073.2597122
DOI10.1145/2597073.2597122
Full Text
AttachmentSize
PDF icon pullreqs-dataset.pdf616.89 KB