A FuseFormer block is used in the FuseFormer model for video inpainting. It is the same to standard Transformer block except that feed forward network is replaced with a Fusion Feed Forward Network (F3N). F3N brings no extra parameter into the standard feed forward net and the difference is that F3N inserts a soft-split and a soft composite operation between the two layer of MLPs.
Source: FuseFormer: Fusing Fine-Grained Information in Transformers for Video InpaintingPaper | Code | Results | Date | Stars |
---|