Tony Kong's picture

Tony Kong

TonyK

·

ZLKong

AI & ML interests

None yet

Organizations

authored a paper 6 months ago

Democratizing AI scientists using ToolUniverse

Paper • 2509.23426 • Published Sep 27, 2025 • 40

authored 10 papers about 1 year ago

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge

Paper • 2312.05693 • Published Dec 9, 2023 • 1

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge

Paper • 2402.10787 • Published Feb 16, 2024

You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model

Paper • 2211.11152 • Published Nov 21, 2022

Exploring Token Pruning in Vision State Space Models

Paper • 2409.18962 • Published Sep 27, 2024

Search for Efficient Large Language Models

Paper • 2409.17372 • Published Sep 25, 2024

Fast and Memory-Efficient Video Diffusion Using Streamlined Inference

Paper • 2411.01171 • Published Nov 2, 2024

Rethinking Token Reduction for State Space Models

Paper • 2410.14725 • Published Oct 16, 2024

Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published Dec 8, 2024 • 11

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Paper • 2501.04315 • Published Jan 8, 2025

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Paper • 2503.10970 • Published Mar 14, 2025 • 18