TinyChatEngine
|
Public Member Functions | |
Int4OPTDecoderLayer (std::string param_path, const struct model_config config, int layer_idx) | |
struct Int4OPTDecoderLayer_output | forward (const struct Int4OPTDecoderLayer_input &input) |
Public Attributes | |
int | embed_dim |
int | num_attention_heads |
int | hidden_dim |
int | layer_idx |
LayerNorm | self_attn_layer_norm |
LayerNorm | final_layer_norm |
Linear_FP_int4 | fc1 |
Linear_FP_int4 | fc2 |
Int4OPTAttention | attn |
std::string | profile_name = "Int4OPTDecoderLayer" |