lm_datasets
updated
Viewer
• Updated • 243k • 780
• 220
argilla/OpenHermesPreferences
Viewer
• Updated • 989k • 979
• 214
Viewer
• Updated • 1M • 20.1k
• 834
Viewer
• Updated • 949k • 7.67k
• 492
Viewer
• Updated • 31.1M • 20.4k
• 701
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 72.9k
• 710
Viewer
• Updated • 7.5k • 2.48k
• 171
Viewer
• Updated • 5.91k • 163
• 21
Note Code dataset
Viewer
• Updated • 49.6k • 5.08k
• 174
Note Code dataset
code-search-net/code_search_net
Viewer
• Updated • 4.14M • 25.4k
• 329
Note Code dataset
Vezora/Tested-143k-Python-Alpaca
Viewer
• Updated • 143k • 609
• 54
Note Code dataset
haripritam/function-calling-alpaca
Viewer
• Updated • 72.4k • 27
• 1
AndreiMuresanu/alpaca_flan-format
Viewer
• Updated • 51.9k • 6
• 2
Viewer
• Updated • 68.9k • 2.54k
• 146
whitefox44/AlpacaGPT3.5Customized
Viewer
• Updated • 55.8k • 14
• 5
SkyHuReal/DrugBank-Alpaca
Viewer
• Updated • 3.87k • 72
• 11
Viewer
• Updated • 9.44k • 23
• 18
Note Code dataset
llm-wizard/dolly-15k-instruction-alpaca-format
Viewer
• Updated • 15k • 224
• 38
ChobPT/gradio_docs_alpaca
Viewer
• Updated • 2.23k • 24
• 2
QuixiAI/WizardLM_alpaca_evol_instruct_70k_unfiltered
Viewer
• Updated • 55k • 292
• 147
Viewer
• Updated • 333k • 110
• 20
DevAibest/alpaca-geotherm-data
Viewer
• Updated • 643k • 36
haripritam/function-calling-alpaca-rejections
Viewer
• Updated • 87.5k • 7
• 3
V3N0M/Jenna-50K-Alpaca-Uncensored
Viewer
• Updated • 54.4k • 228
• 21
TokenBender/code_instructions_122k_alpaca_style
Viewer
• Updated • 122k • 1.7k
• 80
Note Code dataset
TokenBender/python_evol_instruct_51k
Viewer
• Updated • 51.3k • 212
• 6
Note Code dataset
Viewer
• Updated • 370k • 2.2k
• 25
Note Code dataset
ise-uiuc/Magicoder-OSS-Instruct-75K
Viewer
• Updated • 75.2k • 33.8k
• 164
Note Code dataset
ise-uiuc/Magicoder-Evol-Instruct-110K
Viewer
• Updated • 111k • 26.1k
• 178
Note Code dataset
bigcode/the-stack-v2-train-full-ids
Viewer
• Updated • 60.5M • 952
• 60
Note Code dataset
bigcode/self-oss-instruct-sc2-exec-filter-50k
Viewer
• Updated • 50.7k • 11k
• 106
Note Code dataset
ArmelR/stack-exchange-instruction
Viewer
• Updated • 12.2M • 1.15k
• 71
Note Code dataset
Viewer
• Updated • 546M • 20.8k
• 998
Note Code dataset.
Viewer
• Updated • 207M • 28.9k
• 511
Note Code dataset.
bigcode/bigcode-pii-dataset
Viewer
• Updated • 12.1k • 65
• 55
bigcode/pseudo-labeled-python-data-pii-detection-filtered
Viewer
• Updated • 17.7k • 20
• 7
Viewer
• Updated • 1M • 8.35k
• 892
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 86.7k
• 460
Viewer
• Updated • 20.3k • 8.48k
• 192
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
• Updated • 2.38k • 1.57k
• 130
karpathy/tiny_shakespeare
Updated • 5.62k
• 76
nvidia/Nemotron-Post-Training-Dataset-v2
Viewer
• Updated • 6.34M • 9.39k
• 131
Viewer
• Updated • 476M • 59.4k
• 866