Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection 8 days ago
ROBOT-OpenVLA updated a collection 8 days ago
ROBOT-OpenVLA updated a model 8 days ago
rghosh8/openvla-7b-libero-spatialOrganizations
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 192 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 95 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 2 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 171 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 2 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 85
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 16 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 1 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 17 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 2
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 91 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 2 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 71
ROBOT-OpenVLA
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 16 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 1 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 17 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 2
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 192 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 95 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 91 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 2 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 71
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 2 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 171 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 2 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 85