Instructions to use aab20abdullah/qwen_OSINT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use aab20abdullah/qwen_OSINT with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="aab20abdullah/qwen_OSINT",
	filename="qwen3-4b-thinking-2507.Q4_K_M.gguf",
)

llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use aab20abdullah/qwen_OSINT with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf aab20abdullah/qwen_OSINT:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf aab20abdullah/qwen_OSINT:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf aab20abdullah/qwen_OSINT:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf aab20abdullah/qwen_OSINT:Q4_K_M

Use Docker

docker model run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M

LM Studio
Jan

vLLM

How to use aab20abdullah/qwen_OSINT with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "aab20abdullah/qwen_OSINT"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "aab20abdullah/qwen_OSINT",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M

Ollama
How to use aab20abdullah/qwen_OSINT with Ollama:
```
ollama run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M
```

Unsloth Studio new

How to use aab20abdullah/qwen_OSINT with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for aab20abdullah/qwen_OSINT to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for aab20abdullah/qwen_OSINT to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for aab20abdullah/qwen_OSINT to start chatting

Pi new

How to use aab20abdullah/qwen_OSINT with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "aab20abdullah/qwen_OSINT:Q4_K_M"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use aab20abdullah/qwen_OSINT with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default aab20abdullah/qwen_OSINT:Q4_K_M

Run Hermes

hermes

Docker Model Runner
How to use aab20abdullah/qwen_OSINT with Docker Model Runner:
```
docker model run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M
```

Lemonade

How to use aab20abdullah/qwen_OSINT with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull aab20abdullah/qwen_OSINT:Q4_K_M

Run and chat with the model

lemonade run user.qwen_OSINT-Q4_K_M

List all available models

lemonade list

Qwen-OSINT

🎯 Overview

Qwen-OSINT is a specialized large language model fine-tuned from Qwen2.5-7B specifically designed for Open Source Intelligence (OSINT) operations. This model leverages advanced natural language processing capabilities to assist security researchers, analysts, and investigators in gathering, analyzing, and synthesizing information from publicly available sources.

What is OSINT?

Open Source Intelligence (OSINT) refers to the practice of collecting and analyzing information from publicly available sources to support decision-making processes. This includes data from:

🌐 Social media platforms
📰 News articles and publications
🔍 Search engines and databases
💼 Professional networks
🌐 Public records and government databases

✨ Features

Feature	Description
🔎 Advanced Search Analysis	Efficiently analyzes search queries and identifies relevant intelligence sources
📊 Data Synthesis	Consolidates information from multiple sources into coherent summaries
🔐 Security Analysis	Supports threat analysis and vulnerability assessment tasks
📝 Report Generation	Generates structured intelligence reports in various formats
🌐 Multi-language Support	Processes and analyzes content in multiple languages
🛡️ Ethical Compliance	Built with safety guidelines to ensure responsible use

📊 Model Details

Attribute	Value
Base Model	Qwen2.5-7B-Instruct
Framework	Transformers (Hugging Face)
Training Method	Supervised Fine-tuning (SFT)
Vocabulary Size	151,669 tokens
Architecture	Transformer-based Decoder
Precision	FP16 / INT8 compatible

Training Configuration

- Learning Rate: 2e-5
- Batch Size: 8
- Epochs: 3
- Warmup Steps: 100
- Max Sequence Length: 8192

🔧 Installation

Prerequisites

Python >= 3.8
PyTorch >= 2.0
transformers >= 4.35.0
accelerate >= 0.20.0
bitsandbytes >= 0.40.0 (for quantization)

Install Dependencies

pip install transformers torch accelerate bitsandbytes

Download the Model

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "aab20abdullah/qwen_OSINT"

# Download tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)

# Download model
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",
    trust_remote_code=True
)

🚀 Quick Start

Basic Usage

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "aab20abdullah/qwen_OSINT"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)

def generate_intelligence(prompt, max_length=512):
    messages = [{"role": "user", "content": prompt}]
    text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
    inputs = tokenizer([text], return_tensors="pt").to("cuda")
    
    outputs = model.generate(
        **inputs,
        max_new_tokens=max_length,
        temperature=0.7,
        top_p=0.9
    )
    
    response = tokenizer.decode(outputs[0], skip_special_tokens=True)
    return response.split("assistant")[-1].strip()

# Example
result = generate_intelligence("Analyze the key elements of a threat intelligence report.")
print(result)

Quantized Version (Lower Memory Usage)

from transformers import AutoModelForCausalLM, BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
    load_in_8bit=True
)

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    quantization_config=quantization_config,
    device_map="auto",
    trust_remote_code=True
)

💡 Usage Examples

Example 1: Search Query Analysis

prompt = """Analyze the following search query and suggest improvements for OSINT research:
Query: "site:linkedin.com cybersecurity analyst" """

result = generate_intelligence(prompt)
print(result)

Example 2: Data Source Evaluation

prompt = """Evaluate the reliability and credibility of the following OSINT sources:
1. Government statistical databases
2. Academic research papers
3. Social media platforms
4. Open-source code repositories"""

result = generate_intelligence(prompt)
print(result)

Example 3: Threat Analysis Framework

prompt = """Using the MITRE ATT&CK framework, analyze potential threat vectors for:
- Phishing attacks
- Network intrusion
- Data exfiltration

Provide recommendations for detection and prevention.""" 

result = generate_intelligence(prompt)
print(result)

🛡️ Ethical Guidelines

⚠️ IMPORTANT: This model is designed for legitimate OSINT research only.

Acceptable Use Cases ✅

🔍 Security research and vulnerability assessment
📊 Threat intelligence analysis
🛡️ Organizational security posture evaluation
📚 Academic research in cybersecurity
🏢 Corporate due diligence

Prohibited Use Cases ❌

🚫 Unauthorized surveillance
🚫 Invasion of privacy
🚫 Harassment or stalking
🚫 Illegal activities
🚫 Content generation for malicious purposes

Responsible Use Principles

Transparency: Clearly identify yourself when conducting OSINT operations
Legality: Ensure compliance with applicable laws and regulations
Proportionality: Collect only information necessary for your objectives
Security: Protect collected data appropriately
Accountability: Maintain records of your OSINT activities

⚠️ Limitations

Limitation	Description
⚡ Computational Resources	Requires GPU with sufficient VRAM for optimal performance
🎯 Accuracy	May generate plausible but incorrect information - always verify
🌍 Language Coverage	Best performance in English; other languages may vary
📅 Knowledge Cutoff	Training data has a knowledge cutoff date
🔒 Sensitive Data	Not designed to handle highly classified or sensitive information

📄 License

This model is released under the Apache 2.0 License.

Copyright 2024 aab20abdullah

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Base Model License

The base model Qwen2.5 is licensed under the Qwen Research License.

🙏 Acknowledgments

Alibaba Cloud - For developing the Qwen2.5 model architecture
Hugging Face - For providing the model hosting infrastructure
Open Source Community - For continuous contributions to AI safety and ethics

📬 Contact

Model Repository: huggingface.co/aab20abdullah/qwen_OSINT
Author: aab20abdullah

📝 Citation

If you use this model in your research or project, please cite:

@model{qwen_osint,
  author = {aab20abdullah},
  title = {Qwen-OSINT: A Specialized Model for Open Source Intelligence},
  year = {2024},
  publisher = {Hugging Face},
  url = {https://huggingface.co/aab20abdullah/qwen_OSINT}
}

⭐ If you find this model useful, please consider giving it a star!

Made with ❤️ for the OSINT community

Downloads last month: 606

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

4-bit

5-bit

8-bit

Model tree for aab20abdullah/qwen_OSINT

Base model

Qwen/Qwen3-4B-Base

Finetuned

Qwen/Qwen3-4B

Quantized

(219)

this model