|
| Int4llamaAttention_input (Matrix3D< float > hidden_states_, Matrix3D< float > attention_mask_, int layer_idx_) |
|
| Int4llamaAttention_input (Matrix3D< float > hidden_states_, Matrix3D< float > attention_mask_, Matrix3D< float > past_key_, Matrix3D< float > past_value_, bool has_past_key_value_, int layer_idx_) |
|
|
bool | has_past_key_value = false |
|
int | layer_idx |
|
Matrix3D< float > | hidden_states |
|
Matrix3D< float > | attention_mask |
|
Matrix3D< float > | past_key |
|
Matrix3D< float > | past_value |
|
The documentation for this struct was generated from the following file: