|
TinyChatEngine
|
Public Member Functions | |
| Int4OPTDecoderLayer (std::string param_path, const struct model_config config, int layer_idx) | |
| struct Int4OPTDecoderLayer_output | forward (const struct Int4OPTDecoderLayer_input &input) |
Public Attributes | |
| int | embed_dim |
| int | num_attention_heads |
| int | hidden_dim |
| int | layer_idx |
| LayerNorm | self_attn_layer_norm |
| LayerNorm | final_layer_norm |
| Linear_FP_int4 | fc1 |
| Linear_FP_int4 | fc2 |
| Int4OPTAttention | attn |
| std::string | profile_name = "Int4OPTDecoderLayer" |