CGCL-codes/naturalcc
NaturalCC: An Open-Source Toolkit for Code Intelligence
This toolkit helps software engineering researchers and developers train custom machine learning models to understand and work with code. You feed it code-related datasets, and it helps you create models for tasks like automatically generating code, completing code snippets, summarizing code, or finding similar code sections. It's designed for those who want to build and evaluate advanced code intelligence systems.
316 stars.
Use this if you are a researcher or developer focused on building and evaluating custom machine learning models for various code intelligence tasks.
Not ideal if you are looking for an out-of-the-box application to use directly without model training or customization.
Stars
316
Forks
59
Language
Python
License
MIT
Category
Last pushed
Feb 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CGCL-codes/naturalcc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called...
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase...
cltk/cltk
The Classical Language Toolkit
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.