Instructions to use transformers-community/custom_generate_example with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use transformers-community/custom_generate_example with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("transformers-community/custom_generate_example", dtype="auto") - Notebooks
- Google Colab
- Kaggle
| library_name: transformers | |
| tags: | |
| - custom_generate | |
| ## Description | |
| Example repository used to document `generate` from the hub. It is a simplified implementation of greedy decoding. | |
| ## Base model: | |
| `Qwen/Qwen2.5-0.5B-Instruct` | |
| ## Model compatibility | |
| Most models. More specifically, any `transformer` LLM/VLM trained for causal language modeling. | |
| ## Additional Arguments | |
| `left_padding` (`int`, *optional*): number of padding tokens to add before the provided input | |
| ## Output Type changes | |
| (none) | |
| ## Example usage | |
| ```py | |
| from transformers import AutoModelForCausalLM, AutoTokenizer | |
| tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B-Instruct") | |
| model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B-Instruct", device_map="auto") | |
| inputs = tokenizer(["The quick brown"], return_tensors="pt").to(model.device) | |
| # There is a print message hardcoded in the custom generation method | |
| gen_out = model.generate(**inputs, left_padding=5, custom_generate="transformers-community/custom_generate_example", trust_remote_code=True) | |
| print(tokenizer.batch_decode(gen_out)) # don't skip special tokens | |
| #['<|endoftext|><|endoftext|><|endoftext|><|endoftext|><|endoftext|>The quick brown fox jumps over the lazy dog.\n\nThe sentence "The quick'] | |
| ``` | |