<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="6.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Fagerholm, Fabian</style></author><author><style face="normal" font="default" size="100%">Taina, Juha</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Collecting data from distributed FOSS projects</style></title><secondary-title><style face="normal" font="default" size="100%">3rd Workshop on Public Data about Software Development (WoPDaSD 2008)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">bitkeeper</style></keyword><keyword><style  face="normal" font="default" size="100%">bug tracking system</style></keyword><keyword><style  face="normal" font="default" size="100%">cvs</style></keyword><keyword><style  face="normal" font="default" size="100%">distributed</style></keyword><keyword><style  face="normal" font="default" size="100%">email archive</style></keyword><keyword><style  face="normal" font="default" size="100%">fork rate</style></keyword><keyword><style  face="normal" font="default" size="100%">git</style></keyword><keyword><style  face="normal" font="default" size="100%">life cycle</style></keyword><keyword><style  face="normal" font="default" size="100%">linux</style></keyword><keyword><style  face="normal" font="default" size="100%">linux kernel</style></keyword><keyword><style  face="normal" font="default" size="100%">mailing list</style></keyword><keyword><style  face="normal" font="default" size="100%">merge rate</style></keyword><keyword><style  face="normal" font="default" size="100%">subversion</style></keyword><keyword><style  face="normal" font="default" size="100%">svn</style></keyword><keyword><style  face="normal" font="default" size="100%">version control</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2008</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2009</style></date></pub-dates></dates><urls><related-urls><url><style face="normal" font="default" size="100%">http://ifipwg213.org/sites/flosshub.org/files/fagerholm.pdf</style></url></related-urls></urls><pages><style face="normal" font="default" size="100%">8-13</style></pages><abstract><style face="normal" font="default" size="100%">A key trait of Free and Open Source Software (foss) development is its distributed nature. Nevertheless, two project-level operations, the fork and the merge of program code, are among the least well understood events in the lifespan of a foss project. Some projects have explicitly adopted these operations as the primary means of concurrent development. In this study, we examine the eﬀect of highly distributed software development, as found in the Linux kernel project, on collection and modelling of software development data. We ﬁnd that distributed development calls for sophisticated temporal modelling techniques where several versions of the source code tree can exist at once. Attention must be turned towards the methods of quality assurance and peer review that projects employ to manage these parallel source trees. Our analysis indicates that two new metrics, fork rate and merge rate, could be useful for determining the role of distributed version control systems in foss projects. The study 
presents a preliminary data set consisting of version control and mailing list data. </style></abstract><notes><style face="normal" font="default" size="100%">&quot;We selected three projects from the initial set of projects: Linux 2.6, an operating system kernel, gimp, a graphics program, and Blender, a 3d content creation suite.&quot;
&quot;To acquire data from each data source, we wrote special programs based on the earlier prototypes....The ﬁrst program extracts information from mailing list archives....The second program obtains bug reports from bug tracking systems....The third program obtains source code from network-accessible version control systems and runs metric calculations on it.&quot;</style></notes></record></records></xml>