Contains the keyword dataset

Gousios G, Vasilescu B, Serebrenik A, Zaidman A. Lean GHTorrent: GitHub Data on Demand. In Proceedings of the 11th Working Conference on Mining Software Repositories [Internet]. New York, NY, USA: ACM; 2014. pp. 384–387. http://doi.acm.org/10.1145/2597073.2597126PDF icon lean-ghtorrent.pdf (766.66 KB)
Squire M. Apache-Affiliated Twitter Screen Names: A Dataset. 10th Working Conference on Mining Software Repositories (MSR2013). 2013. PDF icon apacheTwitterPREPRINT.pdf (262.49 KB)PDF icon MSR presentation.pdf (1.56 MB)