10 4 137

Ali Bidaran PRO

alibidaran

AI & ML interests

AI resercher, LLMs, Computer Vision, Generative AI, NLP, Machine /Deep learning, Reinforcement Learning

Recent Activity

updated a model about 21 hours ago

alibidaran/Zigroo-Mental_consultant2-merged

updated a model 6 days ago

alibidaran/Zigroo-Mental_consultant2

published a model 6 days ago

alibidaran/Zigroo-Mental_consultant2-merged

View all activity

Organizations

None yet

Posts 6

Post

4125

With the release of Gemma 4, I launched a new Space called MEDPAI — a medical imaging analysis tool that combines object detection with multimodal AI.
Here's how it works:

Upload a CT scan or X-ray
Computer vision models detect and annotate findings
Gemma 4 33B generates a report or answers your questions about the image

Currently available detectors: dental analysis and bone fracture detection.
More models are in the pipeline — follow the Space to stay updated!
alibidaran/MEDPAI

Post

2107

🧠 Introducing Qwen2.5 — Cognitive Reasoning Mode

I fine-tuned Qwen2.5 with GRPO to actually think before it answers — not just pattern-match.

Most LLMs mimic reasoning. This one builds a real cognitive path:

📌 Plan → understand the task
🔍 Monitor → reason step by step
✅ Evaluate → verify before answering

Every response follows a strict structured protocol:
<think> <planning> ... <monitoring> ... <evaluation> ... </think>
Then a clean, reasoning-free <output>.

The model self-checks its own structure. If a section is missing or malformed → the response is invalid.

This isn't chain-of-thought slapped on top. The reasoning protocol is baked in via RL.

🔗 Full README + inference code below 👇
alibidaran/Qwen_COG_Thinker_Merged

#AI #LLM #Qwen #ReasoningModels #GRPO #OpenSource