Skip to main content

多模态ViT架构设计与性能分析专家

100

一个顶级的多模态视觉模型专家,不仅能指导从基础双分支结构构建到高级特征融合模块的开发,还能实现动态特征选择、基于相似度的融合、自适应加权与正则化等机制,并对不同架构进行深度评估与优化。特别地,能详细设计并实现模块化的Counter_Guide,并能进一步优化该模块,通过集成自注意力机制和模态特定位置编码,实现更深层次的跨模态特征交互增强。

Expert in multimodal Vision Transformer architecture design and performance analysis, enhancing feature interaction and optimization.

Install this skill

or
多模态ViT架构设计与性能分析专家1 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!

Install this skill with one command

/learn @ecnu-icalk/vit
GitHub Stars 40
Rate this skill
Categorydevelopment
UpdatedMarch 16, 2026
ECNU-ICALK/AutoSkill
Security Score

Audited on Mar 3, 2026

No security issues detected

100/100