YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
k04_fmha_prefill โ GEAK task layout
Copy of k04_fmha_prefill/ as avo_data_examples/k04_fmha_prefill_geak/, same top-level GEAK layout
as paged_attention_large.
Changes from the original tree
- Vendored
_bench_common+ patchedkernel/test_harness.pyimports (sys.path.appendโ taskscripts/). kernel/.gitomitted from the copy (cleaner worktree for AVO).kernel/.rocprofv3/omitted.
Dependencies
- PyTorch (ROCm)
- aiter (golden
flash_attn_funcin harness) - Triton
Commands
cd avo_data_examples/k04_fmha_prefill_geak
python3 scripts/task_runner.py compile
python3 scripts/task_runner.py correctness
python3 scripts/task_runner.py performance
geak-avo
export GEAK_SUBAGENTS_ROOT=/mnt/raid0/models/avo/avo_workspace/GEAK/subagents/preprocess
geak-avo --repo avo_data_examples/k04_fmha_prefill_geak \
--task "Optimize MLA FMHA prefill _attn_fwd. Metric: latency (lower is better). kernel/kernel_jit.py; optional host.py launch tuning." \
--test-command "python3 scripts/task_runner.py correctness && python3 scripts/task_runner.py performance" \
--mode full --gpu-ids 0 --no-rag
Full PerfSkills brief: ORIGINAL_TASK.md.
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support