
Qwen/Qwen3-0.6B - Hugging Face
Qwen3-0.6B has the following features: Type: Causal Language Models; Training Stage: Pretraining & Post-training; Number of Parameters: 0.6B; Number of Paramaters (Non …
Qwen/Qwen3-0.6B-Base - Hugging Face
Scaling Law Guided Hyperparameter Tuning: Through comprehensive scaling law studies across the three-stage pre-training pipeline, Qwen3 systematically tunes critical hyperparameters — …
andresnowak/Qwen3-0.6B-instruction-finetuned - Hugging Face
This model is a fine-tuned version of unsloth/Qwen3-0.6B-Base. It has been trained using TRL. from transformers import pipeline question = "If you had a time machine, but could only go to …
suayptalha/Qwen3-0.6B-IF-Expert - Hugging Face
This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its instruction-following and reasoning capabilities. Training was conducted on the …
rd211/Qwen3-0.6B-Instruct - Hugging Face
This model is a fine-tuned version of Qwen/Qwen3-0.6B on the None dataset. We’re on a journey to advance and democratize artificial intelligence through open source and open science.
unsloth/Qwen3-0.6B-GGUF - Hugging Face
Qwen3-0.6B has the following features: Type: Causal Language Models; Training Stage: Pretraining & Post-training; Number of Parameters: 0.6B; Number of Paramaters (Non …
andresnowak/Qwen3-0.6B-instruction-finetuned_v2 - Hugging Face
This model is a fine-tuned version of unsloth/Qwen3-0.6B-Base. It has been trained using TRL. from transformers import pipeline question = "If you had a time machine, but could only go to …
prithivMLmods/Qwen3-0.6B-ft-bf16 - Hugging Face
Qwen3-0.6B-ft-bf16 is a fine-tuned, moderately abliterated variant based on Qwen3-0.6B, the latest generation of large language models in the Qwen series. This version emphasizes …
suayptalha/Qwen3-0.6B-Code-Expert - Hugging Face
This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its code reasoning and generation capabilities. Training was conducted exclusively on the …
README.md · Qwen/Qwen3-0.6B-Base at main - Hugging Face
Qwen3-0.6B-Base has the following features: Type: Causal Language Models; Training Stage: Pretraining; Number of Parameters: 0.6B; Number of Paramaters (Non-Embedding): 0.44B; …
taki555/Qwen3-0.6B-Shadow-FT-BAAI-2k - Hugging Face
$\Rightarrow$ We propose the Shadow-FT framework to tune the INSTRUCT models by leveraging the corresponding BASE models. The key insight is to fine-tune the BASE model, …
unsloth/Qwen3-0.6B-unsloth-bnb-4bit · Hugging Face
Run & export your fine-tuned model to Ollama, llama.cpp or HF. Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture …