---
license: apache-2.0
tags:
- biology
---
# OneGenome-Rice (OGR)
OGR is a foundational model for AI-driven precision breeding and functional genomics in rice. It is a generative genomic foundation model trained to process DNA sequences up to **1 million** base pairs in length, with **1.25B** total parameters and a **Mixture-of-Experts (MoE)** architecture. It was pre-trained on a curated corpus of **422** rice genomes spanning cultivated and wild *Oryza* diversity.
For instructions, details, and examples, see the project repository [OGR GitHub](https://github.com/zhejianglab/OneGenome-Rice).
The table below summarizes training scale and key hyperparameters.
| Model Specification |
OneGenomeRice (OGR) |
| Model Scale |
| Total Parameters |
1.25B |
| Activated Parameters |
0.33B |
| Architecture |
| Architecture |
MoE |
| Number of Experts |
8 |
| Selected Experts per Token |
2 |
| Number of Layers |
12 |
| Attention Hidden Dimension |
1024 |
| Number of Attention Heads |
16 (GQA, 8 KV groups) |
| MoE Hidden Dimension (per Expert) |
4096 |
| Vocabulary Size |
128 (padded) |
| Context Length |
up to 1Mb |