Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
reaperdoesntknow
/
DiscoverLM-70M
like
0
Text Generation
Transformers
TensorBoard
Safetensors
nohurry/Opus-4.6-Reasoning-3000x-filtered
openbmb/UltraData-Math
yahma/alpaca-cleaned
English
moa_metric
trl
sft
metric-attention
mixture-of-attentions
triangle-inequality
blackhole-rope
discrepancy-calculus
discover
convergentintel
License:
cc
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
DiscoverLM-70M
281 MB
Ctrl+K
Ctrl+K
1 contributor
History:
19 commits
reaperdoesntknow
Update model card: added convergentintel tag
2192d73
verified
6 days ago
.gitattributes
Safe
1.52 kB
initial commit
29 days ago
README.md
Safe
14.2 kB
Update model card: added convergentintel tag
6 days ago
config.json
Safe
1.6 kB
Upload MoAMetricLM
29 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (1).0
Safe
201 kB
xet
Upload 2 files
29 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
Safe
201 kB
xet
Upload events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
29 days ago
generation_config.json
Safe
204 Bytes
Upload MoAMetricLM
29 days ago
model.safetensors
Safe
277 MB
xet
Upload MoAMetricLM
29 days ago
tokenizer.json
Safe
3.38 MB
Upload tokenizer
29 days ago
tokenizer_config.json
Safe
349 Bytes
Upload tokenizer
29 days ago
trainer_state.json
Safe
148 kB
Rename trainer_state (2).json to trainer_state.json
29 days ago