You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Infer cache/RoPE weight dtype from output weights (#146)
- Add `dtype` argument to `precompute_freqs_cis`
- Infer caches/RoPE weights `dtype` from output weight `dtype` in the `Transformer` constructor
This way one can change precision in one place in `generate.py` and it will be propagated throughout the model
0 commit comments