Copyright
©The Author(s) 2025.
World J Gastroenterol. May 21, 2025; 31(19): 104897
Published online May 21, 2025. doi: 10.3748/wjg.v31.i19.104897
Published online May 21, 2025. doi: 10.3748/wjg.v31.i19.104897
Figure 5 Wavelets module of the Wave-Vision Transformer model.
This figure visually illustrates the Wavelets module’s structure and functionality within the Wave-Vision Transformer model. The module is designed to leverage wavelet transform for multi-scale feature extraction while preserving information integrity. By integrating wavelet transform with self-attention mechanisms, Wave-Vision Transformer achieves lossless downsampling, effectively retaining high-frequency information such as textures and edges, which enhances the model’s sensitivity to fine details. DWT: Discrete wavelet transform.
- Citation: Wei W, Zhang XL, Wang HZ, Wang LL, Wen JL, Han X, Liu Q. Application of deep learning models in the pathological classification and staging of esophageal cancer: A focus on Wave-Vision Transformer. World J Gastroenterol 2025; 31(19): 104897
- URL: https://www.wjgnet.com/1007-9327/full/v31/i19/104897.htm
- DOI: https://dx.doi.org/10.3748/wjg.v31.i19.104897