wangleihitcs/Papers

读过的CV方向的一些论文，图像生成文字、弱监督分割等

/ 100

Emerging

This is a curated collection of academic papers and resources focused on integrating computer vision with natural language processing. It helps researchers and practitioners understand and implement advanced AI models for tasks like generating descriptions for images and videos, answering questions about visual content, and automatically creating medical reports from imaging data. It provides insights into what goes into these models (images, videos, questions) and what comes out (textual descriptions, answers, reports).

125 stars. No commits in the last 6 months.

Use this if you are a researcher or AI practitioner looking for a structured overview of cutting-edge papers and code in fields like image captioning, visual question answering, and medical report generation.

Not ideal if you are a non-technical end-user simply seeking a ready-to-use application or API for these tasks, as this repository primarily links to academic resources.

medical-imaging image-captioning visual-question-answering medical-report-generation computer-vision-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 17 / 25

How are scores calculated?

Stars

125

Forks

Language

—

License

—

Higher-rated alternatives

Ayanami0730/deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Hsankesara/DeepResearch

This repository is the collection of research papers in Deep learning, computer vision and NLP.

QizhiPei/Awesome-Biomolecule-Language-Cross-Modeling

Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging...

thuiar/OKD-Reading-List

Papers for Open Knowledge Discovery

roomylee/nlp-papers-with-arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

Explore NLP Tools

All categories Trending NLP directory Insights