|
TinyChatEngine
|
Public Member Functions | |
| Fp32OPTDecoderLayer (std::string param_path, const struct model_config config, int layer_idx) | |
| struct Fp32OPTDecoderLayer_output | forward (const struct Fp32OPTDecoderLayer_input &input) |
Public Attributes | |
| int | embed_dim |
| int | num_attention_heads |
| int | hidden_dim |
| int | layer_idx |
| LayerNorm | self_attn_layer_norm |
| LayerNorm | final_layer_norm |
| Linear_FP | fc1 |
| Linear_FP | fc2 |
| Fp32OPTAttention | attn |
| std::string | profile_name = "Fp32OPTDecoderLayer" |