Access the full text.
Sign up today, get DeepDyve free for 14 days.
A. Ghapanchi (2015)
Investigating the Interrelationships among Success Measures of Open Source Software ProjectsJournal of Organizational Computing and Electronic Commerce, 25
Jehad Dallal, L. Briand (2012)
A Precise Method-Method Interaction-Based Cohesion Metric for Object-Oriented ClassesACM Trans. Softw. Eng. Methodol., 21
Ju Long (2009)
Open Source Software Development Experiences on the Students' Resumes: Do They Count? - Insights from the Employers' PerspectivesJ. Inf. Technol. Educ., 8
Georgios Gousios, D. Spinellis (2017)
Mining Software Engineering Data from GitHub2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)
Communications of AIS, 2005
J. Howison, Megan Conklin, Kevin Crowston (2006)
FLOSSmole: A Collaborative Repository for FLOSS Research Data and AnalysesInt. J. Inf. Technol. Web Eng., 1
Elyazid Akachar, B. Ouhbi, B. Frikh (2019)
A new algorithm for detecting communities in social networks based on content and structure informationInt. J. Web Inf. Syst., 16
C. Bird, Peter Rigby, Earl Barr, David Hamilton, D. Germán, Premkumar Devanbu (2009)
The promises and perils of mining git2009 6th IEEE International Working Conference on Mining Software Repositories
K. Stewart, Anthony Ammeter, Likoebe Maruping (2006)
Impacts of License Choice and Organizational Sponsorship on User Interest and Development Activity in Open Source Software ProjectsInf. Syst. Res., 17
Caius Brindescu, Mihai Codoban, Sergii Shmarkatiuk, Danny Dig (2014)
How do centralized and distributed version control systems impact software changes?Proceedings of the 36th International Conference on Software Engineering
Georgios Gousios (2013)
The GHTorent dataset and tool suite2013 10th Working Conference on Mining Software Repositories (MSR)
Carlos Santos, G. Kuk, Fabio Kon, J. Pearson (2013)
The attraction of contributors in free and open source software projectsJ. Strateg. Inf. Syst., 22
T. Malone, Kevin Crowston (1994)
The interdisciplinary study of coordinationACM Comput. Surv., 26
J. Sutanto, A. Kankanhalli, B. Tan (2014)
Uncovering the relationship between OSS user support networks and OSS popularityDecis. Support Syst., 64
Mohammad Al-Marzouq, V. Grover, J. Thatcher (2015)
Taxing the development structure of open source communities: An information processing viewDecis. Support Syst., 80
M. AlMarzouq, A. AlZaidan, J. AlDallal (2020)
An exploration of free/libra and open source data sources and their use in the field of information systems
(2008)
Advances in the sourceforge research data archive
Yonghee Shin, Andrew Meneely, L. Williams, J. Osborne (2011)
Evaluating Complexity, Code Churn, and Developer Activity Metrics as Indicators of Software VulnerabilitiesIEEE Transactions on Software Engineering, 37
T. Alspaugh, W. Scacchi, Hazeline Asuncion (2010)
Software Licenses in Context: The Challenge of Heterogeneously-Licensed SystemsJ. Assoc. Inf. Syst., 11
K. Stewart, S. Gosain (2006)
The Impact of Ideology on Effectiveness in Open Source Software Development TeamsMIS Q., 30
F. Ennaji, A. Fazziki, H. Abdallaoui, D. Benslimane, Mohamed Sadgal (2019)
A product reputation framework based on social multimedia contentInt. J. Web Inf. Syst., 16
Jungpil Hahn, J. Moon, C. Zhang (2008)
Emergence of New Project Teams from Open Source Software Developer Networks: Impact of Prior Collaboration TiesInf. Syst. Res., 19
Mohammad Al-Marzouq, Li Zheng, Guang Rong, V. Grover (2005)
Open Source: Concepts, Benefits, and ChallengesCommun. Assoc. Inf. Syst., 16
Kevin Crowston, J. Howison, Hala Annabi (2006)
Information systems success in free and open source software development: theory and measuresSoftw. Process. Improv. Pract., 11
Rajiv Krishnamurthy, V. Jacob, S. Radhakrishnan, Kutsal Dogan (2016)
Peripheral Developer Participation in Open Source ProjectsACM Transactions on Management Information Systems (TMIS), 6
Nachiappan Nagappan, T. Ball (2007)
Using Software Dependencies and Churn Metrics to Predict Field Failures: An Empirical Case StudyFirst International Symposium on Empirical Software Engineering and Measurement (ESEM 2007)
Satnam Kaur, Paramvir Singh (2019)
How does Object-Oriented Code Refactoring Influence Software Quality? Research Landscape and ChallengesJ. Syst. Softw., 157
Rajdeep Grewal, G. Lilien, Girish Mallapragada (2006)
Location, Location, Location: How Network Embeddedness Affects Project Success in Open Source SystemsManag. Sci., 52
W. Wen, Chris Forman, Stuart Graham (2013)
Research Note - The Impact of Intellectual Property Rights Enforcement on Open Source Software Project SuccessInf. Syst. Res., 24
Sebastião Neto, S. Dias, R. Missaoui, Luis Zárate, Mark Song (2018)
Identification of substructures in complex networks using formal concept analysisInt. J. Web Inf. Syst., 14
International Journal of Information Technology and Web Engineering (IJITWE), 1
ACM Transactions on Management Information Systems, 6
Eirini Kalliamvakou, Georgios Gousios, Kelly Blincoe, Leif Singer, D. Germán, D. Damian (2016)
An in-depth study of the promises and perils of mining GitHubEmpirical Software Engineering, 21
Ruth Anderson, Michael Ernst, R. Ordóñez, Paul Pham, Ben Tribelhorn (2015)
A Data Programming CS1 CourseProceedings of the 46th ACM Technical Symposium on Computer Science Education
This study aims to highlight the challenges and opportunities of using GitHub as a data source in both research and programming education.Design/methodology/approachThis study provides general overview of the challenges and opportunities faced while conducting empirical research using GitHub as a data source. The challenges and opportunities are framed using the input–process–output model of open-source software.FindingsGitHub data accessed from the application programming interface (API) can have several limitations, which can be overcome by Web scraping and using external data repositories such as GHArchive and GHTorrent. There are also several idiosyncrasies about GitHub that researchers need to be aware of to be able to use the data effectively, which can represent an opportunity for research. The challenges and opportunities are summarized for the licenses, community, development process and product of free/libra and open-source software communities hosted on GitHub.Originality/valueThis study provides a summary of GitHub-related challenges and opportunities that researchers can leverage to improve their empirical research. Furthermore, this summary can be a valuable resource for instructors that plan to use GitHub as a data source in their data-focused programming courses.
International Journal of Web Information Systems – Emerald Publishing
Published: Oct 8, 2020
Keywords: Communities on the Web; Web mining; Data mining; Data sources; Open source; Applications of Web mining and searching; Web-based education; GitHub; Web scraping; Cloud platform
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.