Full scope of the Special Issue:
- Generative models for media generation, acquisition, compression, and processing, e.g., VAE, GAN, Transformer, NeRF, Gaussian Splatting, Diffusion models, LLMs, etc.;
- The novel applications of classical multimedia signal processing in generative media computing for enhancing interpretability, e.g., Fourier, Wavelet, Cosine Transforms, etc;
- Novel visual computing and processing techniques for complexity reduction and quality enhancement in detection, restoration, and understanding, etc;
- Integration and synthesis of visual data with other modalities, e.g., audio, text;
- Multi- and cross-modal large vision/language models for enhancing computing and understanding, e.g., ViT, Clip, Llama, GPT4, Flan-T5, LLaVA, etc., and applications, e.g., image captioning, and HOI, etc;
- Cross-domain adaptation and transfer learning using generative models for resource efficiency;
- Novel model evaluation, benchmarking, and analysis using synthetic and realistic media data.
Guest Editors
Kejun Wu, Huazhong University of Science and Technology, China.
Lijuan Wang, Microsoft, Redmond, Washington, United States of America.
You Yang, Huazhong University of Science and Technology, China.
Xinchao Wang, National University of Singapore, Singapore.
Gang Yu, StepFun, Shanghai, China.
Junsong Yuan, University at Buffalo, State University of New York, United States of America.
Timeline:
Final Manuscript Submission Deadline 1 Jan. 2025
Editorial Acceptance Deadline 1 May 2025
Click here for the full Call for Papers and submission instructions.
Please select the article type of “VSI: Generative Media” when submitting your manuscript online via the Journal submission link.
