| 
    TinyChatEngine
    
   | 
 
Public Member Functions | |
| Fp32CLIPVisionTransformer (std::string param_path, const struct model_config config, bool is_vila) | |
| struct Fp32CLIPVisionTransformer_output | forward (const struct Fp32CLIPVisionTransformer_input &input, bool is_vila) | 
Public Attributes | |
| Embedding | embed_positions | 
| Conv2D | embed_patch | 
| LayerNorm | pre_layernorm | 
| Linear_FP | mm_proj_0 | 
| Linear_FP | mm_proj_2 | 
| int | voc_size | 
| int | embed_dim | 
| int | padding_idx | 
| int | hidden_dim | 
| int | num_heads | 
| int | image_size | 
| int | patch_size | 
| int | num_patches | 
| int | num_positions | 
| int | projection_dim | 
| int | mmproj_dim | 
| std::vector< Fp32CLIPEncoderLayer > | layers | 
| std::string | profile_name = "Fp32CLIPVisionTransformer" |