Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

piyush-mk
/
invoiceguard-code

openenv
Model card Files Files and versions
xet
Community
invoiceguard-code / training
85 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 24 commits
piyush-mk's picture
piyush-mk
Upload training/train_grpo.py with huggingface_hub
06b7563 verified about 1 month ago
  • README.md
    4.39 kB
    Sync InvoiceGuard code for GRPO training job about 1 month ago
  • __init__.py
    69 Bytes
    Sync InvoiceGuard code for GRPO training job about 1 month ago
  • eval_adapter.py
    6.39 kB
    Fix Qwen3 thinking mode + increase max_new_tokens: training/eval_adapter.py about 1 month ago
  • launch_hf_job.py
    7.23 kB
    Fix Qwen3 thinking mode + increase max_new_tokens: training/launch_hf_job.py about 1 month ago
  • merge_adapter.py
    2.64 kB
    Upload folder using huggingface_hub about 1 month ago
  • rollout.py
    5.71 kB
    Upload folder using huggingface_hub about 1 month ago
  • train_grpo.py
    39 kB
    Upload training/train_grpo.py with huggingface_hub about 1 month ago
  • train_sft.py
    19.6 kB
    v5d: save best-epoch checkpoint during SFT about 1 month ago