Model Upload & Card Generator

Create model cards and upload fine-tuned models to Hugging Face Hub.

Gather Context

If coming from training manager, you should have:

model_path, base_model, dataset, technique
training_config (LoRA rank, LR, epochs)
final_loss, training_time, hardware

If missing, ask for essential information.

Configuration

1. Repository Settings

Ask for:

Repo name: username/model-name
Visibility: Public or Private
License: MIT, Apache 2.0, CC-BY-4.0, Llama 3 Community, etc.

2. Export Formats

Options:

LoRA adapter only (~50-200MB) - Users merge themselves
Merged 16-bit (15-140GB) - Ready to use
GGUF quantized (4-8GB) - For llama.cpp/Ollama
All of the above (Recommended)

3. GGUF Quantization

If GGUF selected, ask which levels. See references/GGUF_GUIDE.md.

Method	Size	Quality
Q4_K_M	~4GB	Good (Recommended)
Q5_K_M	~5GB	Better
Q8_0	~8GB	Best

Generate Model Card

Create README.md with:

YAML Metadata - license, tags, base_model, datasets
Model Description - Table with key attributes
Training Details - Hyperparameters, LoRA config, results
Usage Examples - Transformers, Unsloth, Ollama, llama.cpp
Intended Use - Primary use cases, out-of-scope
Limitations - Biases, known issues
Citation - BibTeX entry

Execute Upload

1. Create Repository

python

from huggingface_hub import create_repo
create_repo("username/model-name", private=False, exist_ok=True)

2. Upload Files

python

from huggingface_hub import HfApi
api = HfApi()

# LoRA adapter
api.upload_folder(folder_path="./outputs/lora_adapter", repo_id="username/model")

# Model card
api.upload_file(path_or_fileobj="README.md", path_in_repo="README.md", repo_id="username/model")

3. Generate GGUF (if selected)

python

from unsloth import FastLanguageModel

model, tokenizer = FastLanguageModel.from_pretrained("./outputs/lora_adapter")
model.save_pretrained_gguf("./gguf", tokenizer, quantization_method="q4_k_m")

Use scripts/convert_gguf.py for multiple quantizations.

4. Verify

python

from huggingface_hub import list_repo_files
print(list_repo_files("username/model"))

Final Report

Upload Complete!

Model: https://huggingface.co/{repo_name}

Uploaded:

LoRA adapter

Model card

GGUF files (if selected)

Next steps:

Verify model page

Add example outputs

Run benchmarks

Share on social media

Model Card Best Practices

Be specific about limitations
Include usage examples - copy-pasteable
Document training details
Credit sources - base model, dataset, tools
Use tables - easier to scan

Error Handling

Error	Resolution
Repo exists	Use `exist_ok=True`
Permission denied	Check HF token has write access
Upload timeout	Use chunked upload

Bundled Resources

scripts/convert_gguf.py - GGUF conversion
references/GGUF_GUIDE.md - GGUF details and Ollama setup
references/TROUBLESHOOTING.md - Upload issues

Search AI Tools

funsloth-upload

Install this agent skill to your Project

SKILL.md

Model Upload & Card Generator

Gather Context

Configuration

1. Repository Settings

2. Export Formats

3. GGUF Quantization

Generate Model Card

Execute Upload

1. Create Repository

2. Upload Files

3. Generate GGUF (if selected)

4. Verify

Final Report

Model Card Best Practices

Error Handling

Bundled Resources