Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chunyuan Deng's picture
2 7 2

Chunyuan Deng

CharlesDDDD
·
https://charlesdddd.github.io/
  • ChunyuanDeng
  • CharlesDDDD

AI & ML interests

Architecheture, Interpretability.

Recent Activity

updated a model 1 day ago
CharlesDDDD/looped_600M_gdn_4to1
published a model 1 day ago
CharlesDDDD/looped_600M_gdn_4to1
updated a model 1 day ago
CharlesDDDD/looped_window_attnetion_1B_formal
View all activity

Organizations

Georgia Institute of Technology's profile picture Rice University's profile picture Chili lab @ Rice U's profile picture

CharlesDDDD 's collections 2

llama
Llama baseline checkpoints (0.6B, 1.3B)
  • CharlesDDDD/llama_0.6B_100bt_formal

    Updated Feb 4 • 2
  • CharlesDDDD/llama_1.3B_100bt_formal

    Updated Feb 4 • 2
looped_transformer
Looped transformer checkpoint collection
  • CharlesDDDD/looped_window_600M_full_512_256_128_weighted_loss

    Updated Feb 4
  • CharlesDDDD/looped_600M_bookend_2_looped4_window_128

    Updated Feb 4
  • CharlesDDDD/looped_600M_4to1_looped1_window_128

    Updated Feb 4
  • CharlesDDDD/looped_transformer_loop_count_4

    Updated Feb 5 • 1
llama
Llama baseline checkpoints (0.6B, 1.3B)
  • CharlesDDDD/llama_0.6B_100bt_formal

    Updated Feb 4 • 2
  • CharlesDDDD/llama_1.3B_100bt_formal

    Updated Feb 4 • 2
looped_transformer
Looped transformer checkpoint collection
  • CharlesDDDD/looped_window_600M_full_512_256_128_weighted_loss

    Updated Feb 4
  • CharlesDDDD/looped_600M_bookend_2_looped4_window_128

    Updated Feb 4
  • CharlesDDDD/looped_600M_4to1_looped1_window_128

    Updated Feb 4
  • CharlesDDDD/looped_transformer_loop_count_4

    Updated Feb 5 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs