thukg/AMinerOpen
An open source community who focuses on developing and publishing elegant algorithms, models and tools for science big data mining and knowledge intelligence with AMiner resources.
This project provides APIs to work with large academic datasets from AMiner, offering pre-trained word embeddings for Chinese and English scientific texts, tools for analyzing research funding applications, and capabilities for extracting structured information about researchers. It helps academic researchers, data scientists, and organizations working with scientific literature to process and understand vast amounts of publication data and researcher profiles. Users input text (like titles or abstracts) or researcher names, and receive analyzed data such as embeddings, discipline classifications, or structured profile information.
No commits in the last 6 months.
Use this if you need to perform advanced text analysis on large volumes of academic papers, classify research documents, or extract structured information about researchers and their affiliations.
Not ideal if you are looking for a standalone code repository or a tool that doesn't rely on external APIs and large datasets.
Stars
10
Forks
—
Language
—
License
MIT
Category
Last pushed
Jul 27, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/thukg/AMinerOpen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment...
jonathandunn/text_analytics
Basic text analytics and natural language processing in Python
IBM/watson-document-co-relation
Correlate text content across documents using Watson NLU, Python NLTK and Watson Studio.
Clarifai/clarifai-pyspark
Interfaces for Unstructured data and ML pipelines with Databricks and Clarifai
umer7/Applied-Text-Mining-in-Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan