OpenBuddy Community

community

AI & ML interests

Community of researchers interested in OpenBuddy Early Access. Please note that this is not an official group for OpenBuddy, and the members have no affiliation with the OpenBuddy team. Every independent researchers can apply to join by submitting a Form available in our GitHub.

raincandy-u

posted an update 5 months ago

Post

3108

Introducing Rain-v2: Democratizing LLM training on gaming GPUs! ⚡

Following Rain-100M, we’re scaling up. Rain-v2 features a larger training dataset.

We’ve published a comprehensive blog covering the end-to-end journey—from raw data collection to rigorous evaluation and safety testing.

HF Repo: 🤗 raincandy-u/Rain-v2

Blog: 📚
https://angelkawaii.xyz/2026/01/29/rain-v2/

Special thanks to the open-source community and the SmolLM2 team for their foundational work! 🚀

HuggingFaceTB
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model (2502.02737)

raincandy-u

posted an update 5 months ago

Post

5713

🤗 Just released Rain-100M, an experimental ~97M-parameter Qwen3-style language model trained from random initialization.

Repo: raincandy-u/Rain-100M

Data: HuggingFaceFW/fineweb-edu, ~3B tokens, English only

Tokenizer: custom 16k BPE, context length 4096

Architecture: 12 Transformer layers, hidden size 768, 12 heads, MLP 2048, SiLU, bf16

Rain-100M is a raw base model (not instruction-tuned or safety-aligned), aimed at small-scale research, debugging training pipelines, and CPU/edge experiments. If you run evaluations, finetunes, or visualizations with it, I would be very interested in your results!

3 replies

AtAndDev

posted an update 11 months ago

Post

700

Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE

AtAndDev

posted an update about 1 year ago

Post

3179

deepseek-ai/DeepSeek-R1-0528

This is the end

1 reply

AtAndDev

posted an update about 1 year ago

Post

3157

Llama 4 is out...

3 replies

AtAndDev

posted an update over 1 year ago

Post

4394

There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...

6 replies

AtAndDev

posted an update over 1 year ago

Post

1682

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

AtAndDev

posted an update over 1 year ago

Post

2508

@nroggendorff is that you sama?

2 replies

osmedi

authored a paper over 1 year ago

Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation

Paper • 2411.18335 • Published Nov 27, 2024 • 3

AtAndDev

posted an update over 1 year ago

Post

1957

everywhere i go i see his face

AtAndDev

posted an update over 1 year ago

Post

591

Deepseek gang on fire fr fr

AtAndDev

posted an update over 1 year ago

Post

1667

R1 is out! And with a lot of other R1 releated models...

AtAndDev

posted an update over 1 year ago

Post

505

@s3nh Hey man check your discord! Got some news.

4 replies

Niansuh

posted an update almost 2 years ago

Post

4006

Use GPT-4, GPT-4 Turbo Preview, GPT-3.5 Turbo, BingAI, and other models. The interface is similar to ChatGPT, with a speedy API endpoint.

https://huggingface.co/spaces/NiansuhAI/Copilot

4 replies

Niansuh

posted an update about 2 years ago

Post

2209

Use GPT-4o + GPT-4-Turbo-Preview + GPT-3.5-Turbo + BingAI

https://huggingface.co/spaces/NiansuhAI/Copilot

2 replies

raincandy-u

posted an update about 2 years ago

Post

2691

🤗 I trained what is probably the smallest (600k ~) TinyStories model! It really can write grammatically correct stories!

raincandy-u/TinyStories-656K

Try this space based on this minuscule model!

https://huggingface.co/spaces/raincandy-u/Story-Teller

Edit: Moreover, the model weight size is only 1.31MB under bf16, and can be reduced to the 700KB level when using Q8_0 quantization U•ェ•*U

Edit: Now 1000K params chat model!

raincandy-u/TinyChat-1776K

3 replies

Niansuh

posted an update about 2 years ago

Post

1188

**Model Names:** gpt-4-turbo-preview, gpt-4-vision-preview, gpt-3.5-turbo-16k
**Searchable Models:** Creative, Balanced, Precise

Image creation will be available soon in NiansuhAI.
**Model Name:** DALL-E 3

https://huggingface.co/spaces/NiansuhAI/LLMs1
---