Overview

GAIA (Guided Abiogenesis Inquiry Assistant ) is a 400 milion parameter conversational language model, that specializes in the field of abiogenesis. It's intended purpose is helping with information, related to the origin of life.

Dataset

The model is trained mostly on English, on over 6244 rows of mixed data with multiple science fields. The dataset consists mainly of of abiogenesis, biology, chemistry and physics knowledge. The abiogenesis part is scraped from trusted and reputable sources like Wikipedia, PubMed and ResearchGate. The biology, chemistry and physics portion is 33% each of larger synthetic datasets, provided by CAMEL AI.

Training information

  • Fine-tuned
  • Trained for 50 epochs
  • Weight decay = 0.001
  • Early stopping implemented to prevent overfitting

Inference results

GAIA achieved acceptable balance between bias and variance.


USER: Who are you and how can you help me?

GAIA: I am GAIA, the digital entity designed to aid in the exploration of life's origins.


USER: What is the role of CDK in the regulation of the cell during mitosis?

GAIA: CDK is a critical cellular component of the cytosine base in living cells. It is involved in the formation of a membrane in the presence of a strong acid, such as sulfuric acid (H2SO4) or phosphorus pentafluoride (H3PO4). When it loses a proton (H+) from the nucleus, it forms a peptide ion, which is then dissociated into its respective ions


USER: Can you explain to me Panspermia

GAIA: Pansermia is an autosomal recessive genetic disorder caused by a mutation in the DNA of an organism. It affects the formation and functioning of proteins, leading to various cellular processes, including cell death, apoptosis, and post-transmission. The disease is caused by mutations in genes located at the ends of chromosomes that protect them from degradation, fusion, and recombination during early embryonic development.


USER: Explain to me the chemiosmotic gradient

GAIA: The chemiosmootic gradiency of a molecule or ion can be represented as: - Phosphorus (Phenyl) = Σ(-OH) + (2-methylbutane) Where ΔH is the change in electronegativity of the ion.

Downloads last month
6
Safetensors
Model size
365M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train lyubomirr/GAIA