-
Notifications
You must be signed in to change notification settings - Fork 51
Issues: iree-org/iree-turbine
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Precondition
add_get_results
of expansion pass is not checked
#791
opened May 5, 2025 by
martin-luecke
Calling 2 consective get_kernels and then launching the kernels fails
#770
opened Apr 29, 2025 by
harsh-nod
IREE compilation flag IREE_ENABLE_RUNTIME_TRACING is breaking import iree.turbine.aot
#746
opened Apr 23, 2025 by
voonter
[export] Exporting MoE block leads to CONSTRAINT_VIOLATION error
#702
opened Apr 11, 2025 by
archana-ramalingam
test case failure with "AssertionError: Dynamic dims not currently supported"
#672
opened Apr 2, 2025 by
FlintWangacc
fail to compile deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B model
#659
opened Mar 30, 2025 by
FlintWangacc
Implement a reference wave.template.layers.attention.scaled_dot_product_attention function that smartly handle multiple batch dims (i.e BSD, BHSD, B_1 B H S D, etc), handling of dynamic dimensions, and using LRU cache inside to prevent recompile or even hashing all together.
#656
opened Mar 29, 2025 by
raikonenfnu
Refactor quantized attention kernel to have dynamic seqlen to prevent recompile for LLM
#655
opened Mar 29, 2025 by
raikonenfnu
Add support for GEMM fused with bias (currently stalled/broken because for single output reduction we propagate out the index of the whole reductionOp which consist of all the dimensions inside it as opposed to just the index layout of the yielded value mma in this case.)
#654
opened Mar 29, 2025 by
raikonenfnu
QOP per tensor scaling FP8 GEMM kernel (important for quantized linear layer)
#653
opened Mar 29, 2025 by
raikonenfnu
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-09.