About 710,000 results
Open links in new tab
  1. Qwen/Qwen3-0.6B - Hugging Face

    Qwen3-0.6B has the following features: Type: Causal Language Models; Training Stage: Pretraining & Post-training; Number of Parameters: 0.6B; Number of Paramaters (Non …

  2. Qwen/Qwen3-0.6B-Base - Hugging Face

    Scaling Law Guided Hyperparameter Tuning: Through comprehensive scaling law studies across the three-stage pre-training pipeline, Qwen3 systematically tunes critical hyperparameters — …

  3. andresnowak/Qwen3-0.6B-instruction-finetuned - Hugging Face

    This model is a fine-tuned version of unsloth/Qwen3-0.6B-Base. It has been trained using TRL. from transformers import pipeline question = "If you had a time machine, but could only go to …

  4. suayptalha/Qwen3-0.6B-IF-Expert - Hugging Face

    This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its instruction-following and reasoning capabilities. Training was conducted on the …

  5. rd211/Qwen3-0.6B-Instruct - Hugging Face

    This model is a fine-tuned version of Qwen/Qwen3-0.6B on the None dataset. We’re on a journey to advance and democratize artificial intelligence through open source and open science.

  6. unsloth/Qwen3-0.6B-GGUF - Hugging Face

    Qwen3-0.6B has the following features: Type: Causal Language Models; Training Stage: Pretraining & Post-training; Number of Parameters: 0.6B; Number of Paramaters (Non …

  7. andresnowak/Qwen3-0.6B-instruction-finetuned_v2 - Hugging Face

    This model is a fine-tuned version of unsloth/Qwen3-0.6B-Base. It has been trained using TRL. from transformers import pipeline question = "If you had a time machine, but could only go to …

  8. prithivMLmods/Qwen3-0.6B-ft-bf16 - Hugging Face

    Qwen3-0.6B-ft-bf16 is a fine-tuned, moderately abliterated variant based on Qwen3-0.6B, the latest generation of large language models in the Qwen series. This version emphasizes …

  9. suayptalha/Qwen3-0.6B-Code-Expert - Hugging Face

    This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its code reasoning and generation capabilities. Training was conducted exclusively on the …

  10. README.md · Qwen/Qwen3-0.6B-Base at main - Hugging Face

    Qwen3-0.6B-Base has the following features: Type: Causal Language Models; Training Stage: Pretraining; Number of Parameters: 0.6B; Number of Paramaters (Non-Embedding): 0.44B; …

  11. taki555/Qwen3-0.6B-Shadow-FT-BAAI-2k - Hugging Face

    $\Rightarrow$ We propose the Shadow-FT framework to tune the INSTRUCT models by leveraging the corresponding BASE models. The key insight is to fine-tune the BASE model, …

  12. unsloth/Qwen3-0.6B-unsloth-bnb-4bit · Hugging Face

    Run & export your fine-tuned model to Ollama, llama.cpp or HF. Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture …

Refresh