wangleihitcs/Papers
读过的CV方向的一些论文,图像生成文字、弱监督分割等
This is a curated collection of academic papers and resources focused on integrating computer vision with natural language processing. It helps researchers and practitioners understand and implement advanced AI models for tasks like generating descriptions for images and videos, answering questions about visual content, and automatically creating medical reports from imaging data. It provides insights into what goes into these models (images, videos, questions) and what comes out (textual descriptions, answers, reports).
125 stars. No commits in the last 6 months.
Use this if you are a researcher or AI practitioner looking for a structured overview of cutting-edge papers and code in fields like image captioning, visual question answering, and medical report generation.
Not ideal if you are a non-technical end-user simply seeking a ready-to-use application or API for these tasks, as this repository primarily links to academic resources.
Stars
125
Forks
20
Language
—
License
—
Category
Last pushed
May 16, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/wangleihitcs/Papers"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Ayanami0730/deep_research_bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Hsankesara/DeepResearch
This repository is the collection of research papers in Deep learning, computer vision and NLP.
QizhiPei/Awesome-Biomolecule-Language-Cross-Modeling
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging...
thuiar/OKD-Reading-List
Papers for Open Knowledge Discovery
roomylee/nlp-papers-with-arxiv
Statistics and accepted paper list of NLP conferences with arXiv link