logpai/bughub
A collection of free-text bug reports for duplicate issue identification
This project provides a comprehensive collection of real-world bug reports from major open-source projects like Mozilla, Firefox, and Eclipse. It helps software engineering researchers and practitioners study how to automatically identify duplicate bug reports using natural language processing. The dataset includes free-text bug descriptions and metadata, categorized for training and testing machine learning models.
123 stars. No commits in the last 6 months.
Use this if you are a software engineering researcher or data scientist looking for diverse, pre-processed datasets of bug reports to develop or benchmark automated duplicate bug detection systems.
Not ideal if you need a tool for real-time bug management in an operational setting or if your primary interest is in bug localization rather than duplicate detection.
Stars
123
Forks
27
Language
—
License
—
Category
Last pushed
Mar 18, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/logpai/bughub"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
MinLee0210/Smart-Evaluation-Solution
A project for AngelHack competition - h4ckhcmc 2024
shadmehr-salehi/AI-Hackathon-2023
Solution for Hackathon Problem Sets ( NLP )
knmlprz/BITEHack
2 miejsce na BITEHack 2022 w kategorii AI
chartes/masterHN-hackathons2026
Résultats de la semaine de compétitions et hackathons du master Humanités numériques - année 2026
ssenichev/hacks-ai-BBBB
Решение кейса хакатона от ЦБ: задача предсказания кредитного рейтинга для пресс-релиза КРА