logpai/bughub

A collection of free-text bug reports for duplicate issue identification

38
/ 100
Emerging

This project provides a comprehensive collection of real-world bug reports from major open-source projects like Mozilla, Firefox, and Eclipse. It helps software engineering researchers and practitioners study how to automatically identify duplicate bug reports using natural language processing. The dataset includes free-text bug descriptions and metadata, categorized for training and testing machine learning models.

123 stars. No commits in the last 6 months.

Use this if you are a software engineering researcher or data scientist looking for diverse, pre-processed datasets of bug reports to develop or benchmark automated duplicate bug detection systems.

Not ideal if you need a tool for real-time bug management in an operational setting or if your primary interest is in bug localization rather than duplicate detection.

software-engineering-research bug-tracking issue-management quality-assurance natural-language-processing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 20 / 25

How are scores calculated?

Stars

123

Forks

27

Language

License

Last pushed

Mar 18, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/logpai/bughub"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.