|
|
| Int4llamaAttention_input (Matrix3D< float > hidden_states_, Matrix3D< float > attention_mask_, int layer_idx_) |
| |
|
| Int4llamaAttention_input (Matrix3D< float > hidden_states_, Matrix3D< float > attention_mask_, Matrix3D< float > past_key_, Matrix3D< float > past_value_, bool has_past_key_value_, int layer_idx_) |
| |
|
|
bool | has_past_key_value = false |
| |
|
int | layer_idx |
| |
|
Matrix3D< float > | hidden_states |
| |
|
Matrix3D< float > | attention_mask |
| |
|
Matrix3D< float > | past_key |
| |
|
Matrix3D< float > | past_value |
| |
The documentation for this struct was generated from the following file: