Model Card for LLaMat-2

LLaMat-2 is a specialized large language model designed to be a foundational large language model for materials science.


Overview

  • Model Type: Large Language Model (LLM)
  • Base Model: LLaMat-2 (continued pretraining of LLaMA-3 on material science data)
  • Language: English
  • License: LLaMA-3 License
  • Tags: Material Science, Domain Adaptation, Table Understanding, Scientific Data Parsing, Materials Copilot

Model Details

Key Features

  • Applications: Can be finetuned for information extraction, table understanding, parsing data for research tasks, and crystal structure generation.

Development and Support

  • Developed by: M3RG, IIT Delhi & DAIR, IIT Delhi
  • Compute Support:
    • Edinburgh International Data Facility (EIDF): Provided access to Cerebras CS2 clusters for pretraining.
    • IIT Delhi High-Performance Computing Cluster: Supported fine-tuning and inference stages.

Technical Specifications

Hardware Infrastructure

  • Pretraining: 8 NVIDIA A100 80GB GPUs

Software Stack

  • Frameworks: PyTorch, Hugging Face Transformers

Model Sources


Downloads last month
9
Safetensors
Model size
6.74B params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for m3rg-iitd/llamat-2

Finetuned
(26)
this model
Finetunes
2 models

Collection including m3rg-iitd/llamat-2