Instructions to use InvestmentResearchAI/LLM-ADE_tiny-v0.001 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use InvestmentResearchAI/LLM-ADE_tiny-v0.001 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="InvestmentResearchAI/LLM-ADE_tiny-v0.001")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("InvestmentResearchAI/LLM-ADE_tiny-v0.001")
model = AutoModelForCausalLM.from_pretrained("InvestmentResearchAI/LLM-ADE_tiny-v0.001")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use InvestmentResearchAI/LLM-ADE_tiny-v0.001 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "InvestmentResearchAI/LLM-ADE_tiny-v0.001"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "InvestmentResearchAI/LLM-ADE_tiny-v0.001",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/InvestmentResearchAI/LLM-ADE_tiny-v0.001

SGLang

How to use InvestmentResearchAI/LLM-ADE_tiny-v0.001 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "InvestmentResearchAI/LLM-ADE_tiny-v0.001" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "InvestmentResearchAI/LLM-ADE_tiny-v0.001",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "InvestmentResearchAI/LLM-ADE_tiny-v0.001" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "InvestmentResearchAI/LLM-ADE_tiny-v0.001",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use InvestmentResearchAI/LLM-ADE_tiny-v0.001 with Docker Model Runner:
```
docker model run hf.co/InvestmentResearchAI/LLM-ADE_tiny-v0.001
```

LLM-ADE_tiny-v0.001

Commit History

Upload LlamaForCausalLM

f3bf0b3
verified

WilliamGazeley commited on Jun 28, 2024

Update config.json

87b6a8d
verified

WilliamGazeley commited on Jun 19, 2024

Update prompt template in examples

93e33c7
verified

WilliamGazeley commited on Jun 19, 2024

Upload LlamaForCausalLM

b587cfc
verified

WilliamGazeley commited on Jun 19, 2024

Upload tokenizer

ad8f797
verified

WilliamGazeley commited on Jun 19, 2024

Update README.md

87fd284
verified

stepchoi commited on Apr 22, 2024

Update README.md

04a29db
verified

stepchoi commited on Apr 17, 2024

Update README.md

6e52f42
verified

stepchoi commited on Apr 16, 2024

Update README.md

1a24ccd
verified

stepchoi commited on Apr 16, 2024

Update README.md

3b401dd
verified

Stephen Choi commited on Apr 9, 2024

Update README.md

ec82a77
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

e8e1f22
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

8e10f98
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

5f575a5
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

ee16f19
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

77c70a4
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

0153fb7
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

0e685e7
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

ca70274
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

fab6e3d
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

dec8831
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

68db6f8
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

ee1e121
verified

WilliamGazeley commited on Apr 4, 2024

Update config.json

54779dd
verified

WilliamGazeley commited on Apr 4, 2024

Upload LlamaForCausalLM

da75eb2
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

a487be2
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

95791cf
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

a748257
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

1382b54
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

e2a6ce3
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

786de25
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

9480619
verified

WilliamGazeley commited on Apr 4, 2024

Upload tokenizer

61d68d7
verified

WilliamGazeley commited on Apr 4, 2024

Fix pathing issue

cfa0a0c
verified

WilliamGazeley commited on Apr 4, 2024

Update README.md

6a11c9a
verified

WilliamGazeley commited on Apr 4, 2024

Upload LlamaForCausalLM

e103f94
verified

WilliamGazeley commited on Apr 4, 2024

initial commit

3cdd624
verified

WilliamGazeley commited on Apr 4, 2024

Commit History

Upload LlamaForCausalLM f3bf0b3 verified

Update config.json 87b6a8d verified

Update prompt template in examples 93e33c7 verified

Upload LlamaForCausalLM b587cfc verified

Upload tokenizer ad8f797 verified

Update README.md 87fd284 verified

Update README.md 04a29db verified

Update README.md 6e52f42 verified

Update README.md 1a24ccd verified

Update README.md 3b401dd verified

Update README.md ec82a77 verified

Update README.md e8e1f22 verified

Update README.md 8e10f98 verified

Update README.md 5f575a5 verified

Update README.md ee16f19 verified

Update README.md 77c70a4 verified

Update README.md 0153fb7 verified

Update README.md 0e685e7 verified

Update README.md ca70274 verified

Update README.md fab6e3d verified

Update README.md dec8831 verified

Update README.md 68db6f8 verified

Update README.md ee1e121 verified

Update config.json 54779dd verified

Upload LlamaForCausalLM da75eb2 verified

Update README.md a487be2 verified

Update README.md 95791cf verified

Update README.md a748257 verified

Update README.md 1382b54 verified

Update README.md e2a6ce3 verified

Update README.md 786de25 verified

Update README.md 9480619 verified

Upload tokenizer 61d68d7 verified

Fix pathing issue cfa0a0c verified

Update README.md 6a11c9a verified

Upload LlamaForCausalLM e103f94 verified

initial commit 3cdd624 verified

Upload LlamaForCausalLM

f3bf0b3
verified

Update config.json

87b6a8d
verified

Update prompt template in examples

93e33c7
verified

Upload LlamaForCausalLM

b587cfc
verified

Upload tokenizer

ad8f797
verified

Update README.md

87fd284
verified

Update README.md

04a29db
verified

Update README.md

6e52f42
verified

Update README.md

1a24ccd
verified

Update README.md

3b401dd
verified

Update README.md

ec82a77
verified

Update README.md

e8e1f22
verified

Update README.md

8e10f98
verified

Update README.md

5f575a5
verified

Update README.md

ee16f19
verified

Update README.md

77c70a4
verified

Update README.md

0153fb7
verified

Update README.md

0e685e7
verified

Update README.md

ca70274
verified

Update README.md

fab6e3d
verified

Update README.md

dec8831
verified

Update README.md

68db6f8
verified

Update README.md

ee1e121
verified

Update config.json

54779dd
verified

Upload LlamaForCausalLM

da75eb2
verified

Update README.md

a487be2
verified

Update README.md

95791cf
verified

Update README.md

a748257
verified

Update README.md

1382b54
verified

Update README.md

e2a6ce3
verified

Update README.md

786de25
verified

Update README.md

9480619
verified

Upload tokenizer

61d68d7
verified

Fix pathing issue

cfa0a0c
verified

Update README.md

6a11c9a
verified

Upload LlamaForCausalLM

e103f94
verified

initial commit

3cdd624
verified