04-LOCAL LLM LINKS FOR BRAIN
Here is a curated starting set of open-source / open-weight links for building a local autonomous learning-and-teaching LLM stack for a Solarpunk community, with coverage across medicine, biology, agriculture, astronomy, weather, physics, math, engineering, and software. The strongest pattern in current research is to combine an open local base model with domain-specific RAG and specialist datasets rather than relying on one model for everything.[nature]
Core local model stack
-
OLMo repository — fully open language-model research and training code, designed “by scientists, for scientists”.[github]
-
Ollama library — easy local model access; useful for running open-weight models such as OLMo, Meditron, and math-focused models on local hardware.[ollama]
-
llarga — local RAG application for using local or cloud LLMs without needing a vector database.[github]
-
local-LLM-with-RAG — experimental local LLM + RAG sandbox for self-hosted workflows.[github]
-
Ask Astro — end-to-end open-source LLM app architecture reference you can adapt for community knowledge systems.[github]
-
LLM datasets and post-training resources — useful for instruction tuning, math, code, and multilingual training data curation.[github]
Medicine and health
-
Nature: The path forward for large language models in medicine is open — strong argument for transparent and controllable open-source medical LLMs.[nature]
-
PMC article on healthcare LLM deployment — explains why local open LLMs improve customization and privacy in healthcare settings.[pmc.ncbi.nlm.nih]
-
Harvard Medical School summary of open-source AI in tough medical cases — notes an open-source model matching GPT-4 on a clinical reasoning benchmark.[hms.harvard]
-
Meditron via Ollama library — open-source medical LLM adapted from Llama 2 for medical question answering.[ollama]
Biology and biochemistry
-
Awesome-Scientific-Language-Models — curated scientific models list spanning biology, medicine, chemistry, physics, math, and geoscience.[github]
-
Awesome-LLMs-meet-genomes — collection of LLM methods for genomics.[github]
-
Awesome computational biology list — broad bio/chem/omics tooling and models, including biomedical text generation and protein/genome resources.[github]
-
Scientific-LLM-Survey — survey repo focused on biology and chemistry scientific LLMs.[github]
Agriculture and farming
-
Crop science CROP project — crop-science instruction tuning dataset and benchmark for domain-specific agricultural LLMs.[renqichen.github]
-
Smart Agriculture System Using LLMs — open GitHub project for agricultural analytics and recommendations.[github]
-
Cropin Akshara coverage — describes an Apache-licensed open-source Agri LLM for farm advisory use.[agtecher]
-
OpenFarm discussion — community concept for an open crop-intelligence stack.[reddit]
Astronomy and earth systems
-
Earth2Studio — open-source framework for AI weather and climate workflows.[github]
-
Awesome Earth Artificial Intelligence — curated Earth science AI tools, datasets, and tutorials.[github]
-
Ask Astro — also useful as an architectural pattern for science Q&A systems, even though it is Airflow-focused.[github]
Physics and mathematics
-
Awesome-Scientific-Language-Models — includes mathematics and physics model references.[github]
-
Open-source physics knowledge discussions — points to the practical pattern of local model + physics PDFs via RAG.[reddit]
-
Qwen2 Math in Ollama library — math-specialized local model family for symbolic and problem-solving tasks.[ollama]
Engineering and software
-
OLMo repository — a strong base for reproducible scientific and engineering workflows.[github]
-
BentoML open-source LLM overview — explains self-hosted open-weight deployment and model control.[bentoml]
-
LLMWare / open-source RAG frameworks overview — lists RAG frameworks useful for knowledge-grounded engineering assistants.[dev]
Suggested build pattern
A practical Solarpunk setup is: one local base model, one retrieval layer, and many domain knowledge packs. For example, you could run a local open-weight model through Ollama, connect it to a RAG system like llarga, and feed it curated collections for medicine, crop science, weather, physics, and engineering.[github]
A good architecture is:
-
Local model server: Ollama or a self-hosted OLMo-style stack.[github]
-
Retrieval layer: RAG over community documents, manuals, textbooks, and local knowledge.[dev]
-
Domain adapters: medical, crop, biology, math, climate, and software-specific corpora.[renqichen.github]
-
Governance: local-only inference for privacy, plus human review for health and safety-critical domains.[hms.harvard]
Best starting shortlist
If you want the smallest useful starter set, begin with these five:
Would you like me to turn this into a ranked implementation roadmap for a low-cost local Solarpunk deployment?