wangleihitcs/Papers

读过的CV方向的一些论文,图像生成文字、弱监督分割等

35
/ 100
Emerging

This is a curated collection of academic papers and resources focused on integrating computer vision with natural language processing. It helps researchers and practitioners understand and implement advanced AI models for tasks like generating descriptions for images and videos, answering questions about visual content, and automatically creating medical reports from imaging data. It provides insights into what goes into these models (images, videos, questions) and what comes out (textual descriptions, answers, reports).

125 stars. No commits in the last 6 months.

Use this if you are a researcher or AI practitioner looking for a structured overview of cutting-edge papers and code in fields like image captioning, visual question answering, and medical report generation.

Not ideal if you are a non-technical end-user simply seeking a ready-to-use application or API for these tasks, as this repository primarily links to academic resources.

medical-imaging image-captioning visual-question-answering medical-report-generation computer-vision-research
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 17 / 25

How are scores calculated?

Stars

125

Forks

20

Language

License

Last pushed

May 16, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/wangleihitcs/Papers"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.