Feature Request: Granite 4 Support #13275

gabe-l-hart · 2025-05-02T23:07:15Z

gabe-l-hart · 2025-05-02T23:40:52Z

For reference, support PRs in other platforms:

transformers: Add GraniteMoeHybrid support for 4.0 huggingface/transformers#37658
vllm: [Model] Add GraniteMoeHybrid 4.0 model vllm-project/vllm#17497

ngxson · 2025-05-03T15:52:45Z

Support for NoPE positional encoding instead of RoPE

If this is the same idea with llama 4, then I think we already support this. In short, it's just an if condition:

llama.cpp/src/llama-model.cpp

Lines 4536 to 4547 in 3bf785f

    
           if (use_rope) { 
        
               Qcur = ggml_rope_ext( 
        
                       ctx0, Qcur, inp_pos, rope_factors, 
        
                       n_rot, rope_type, n_ctx_orig, freq_base, freq_scale, 
        
                       ext_factor, attn_factor, beta_fast, beta_slow 
        
                       ); 
        
               Kcur = ggml_rope_ext( 
        
                       ctx0, Kcur, inp_pos, rope_factors, 
        
                       n_rot, rope_type, n_ctx_orig, freq_base, freq_scale, 
        
                       ext_factor, attn_factor, beta_fast, beta_slow 
        
                       );

gabe-l-hart · 2025-05-05T14:14:43Z

@ngxson That's great! Thanks for pointing that out

gabe-l-hart added the enhancement New feature or request label May 2, 2025

gabe-l-hart mentioned this issue May 2, 2025

feat: First pass at llama_kv_cache_hybrid #13276

Draft

gabe-l-hart mentioned this issue May 5, 2025

Add support for IBM Granite-4.0-Tiny-Preview ollama/ollama#10557

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Granite 4 Support #13275

Feature Request: Granite 4 Support #13275

gabe-l-hart commented May 2, 2025 •

edited

Loading

gabe-l-hart commented May 2, 2025

ngxson commented May 3, 2025 •

edited

Loading

gabe-l-hart commented May 5, 2025

Feature Request: Granite 4 Support #13275

Feature Request: Granite 4 Support #13275

Comments

gabe-l-hart commented May 2, 2025 • edited Loading

Prerequisites

Feature Description

Necessary Components

Motivation

Possible Implementation

gabe-l-hart commented May 2, 2025

ngxson commented May 3, 2025 • edited Loading

gabe-l-hart commented May 5, 2025

gabe-l-hart commented May 2, 2025 •

edited

Loading

ngxson commented May 3, 2025 •

edited

Loading