geobrain-ai/geogalactica
Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"
This model helps geoscientists and researchers process vast amounts of specialized text. It takes natural language questions or prompts about geoscience topics and generates relevant, scientifically-grounded text responses. A geoscience researcher or student would use this to quickly synthesize information or understand complex topics.
No commits in the last 6 months.
Use this if you need an AI model specifically trained on geoscience literature to assist with research or information retrieval in that field.
Not ideal if you are looking for a general-purpose language model or one specialized in domains outside of geoscience.
Stars
40
Forks
8
Language
Python
License
Apache-2.0
Category
Last pushed
Jun 27, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/geobrain-ai/geogalactica"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mlabonne/llm-datasets
Curated list of datasets and tools for post-training.
malteos/llm-datasets
A collection of datasets for language model pretraining including scripts for downloading,...
magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your...
jd-coderepos/llms4subjects
The official SemEval 2025 Task 5 - LLMs4Subjects - Shared Task Dataset repository
willxxy/ECG-Bench
A Unified Framework for Benchmarking Generative Electrocardiogram-Language Models (ELMs)