We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions

[ total of 26 entries: 1-25 | 26 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 17 May 2022

[1]  arXiv:2205.07752 (cross-list from cs.CV) [pdf, other]
Title: A Data Cube of Big Satellite Image Time-Series for Agriculture Monitoring
Comments: This work has been accepted for publication in IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Multimedia (cs.MM)
[2]  arXiv:2205.07721 (cross-list from cs.CV) [pdf, other]
Title: Towards Space-to-Ground Data Availability for Agriculture Monitoring
Comments: Has been accepted for publication in IEEE IVMSP 2022: this https URL Specifically in the special session "Multimodal Analysis, Fusion and Retrieval of satellite images": this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[3]  arXiv:2205.07611 (cross-list from cs.CV) [pdf, other]
Title: Noise-Tolerant Learning for Audio-Visual Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[4]  arXiv:2205.07100 (cross-list from cs.CL) [pdf, other]
Title: Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation
Comments: NAACL-SRW 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Fri, 13 May 2022

[5]  arXiv:2205.05880 [pdf, other]
Title: Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2205.05953 (cross-list from cs.HC) [pdf, other]
Title: Emerging Immersive Communication Systems: Overview, Taxonomy, and Good Practises for QoE Assessment
Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[7]  arXiv:2205.05949 (cross-list from eess.AS) [pdf, other]
Title: Automated Audio Captioning: an Overview of Recent Progress and New Challenges
Comments: Submitted to EURASIP Journal on Audio Speech and Music Processing in April
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD)
[8]  arXiv:2205.05854 (cross-list from cs.CV) [pdf, other]
Title: Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos
Authors: Shuo Yang, Xinxiao Wu
Comments: accepted by IJCAI-22, Codes are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[9]  arXiv:2205.05738 (cross-list from cs.CL) [pdf, other]
Title: DISARM: Detecting the Victims Targeted by Harmful Memes
Comments: Accepted at NAACL 2022 (Findings)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Multimedia (cs.MM)

Thu, 12 May 2022

[10]  arXiv:2205.05177 [pdf, other]
Title: ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)

Wed, 11 May 2022

[11]  arXiv:2205.04906 [pdf, other]
Title: Evaluating the Impact of Tiled User-Adaptive Real-Time Point Cloud Streaming on VR Remote Communication
Subjects: Multimedia (cs.MM)
[12]  arXiv:2205.05072 (cross-list from cs.CV) [pdf, other]
Title: Learning Visual Styles from Audio-Visual Associations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[13]  arXiv:2205.05069 (cross-list from cs.CV) [pdf, other]
Title: Accelerating the Training of Video Super-Resolution Models
Comments: The code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[14]  arXiv:2205.04908 (cross-list from cs.CV) [pdf, other]
Title: Shadow-Aware Dynamic Convolution for Shadow Removal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[15]  arXiv:2205.04749 (cross-list from cs.CV) [pdf, other]
Title: Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Tue, 10 May 2022 (showing first 10 of 11 entries)

[16]  arXiv:2205.03782 [pdf, ps, other]
Title: SSIM-Variation-Based Complexity Optimization for Versatile Video Coding
Subjects: Multimedia (cs.MM); Multiagent Systems (cs.MA)
[17]  arXiv:2205.03684 [pdf, other]
Title: Timestamp-independent Haptic-Visual Synchronization
Subjects: Multimedia (cs.MM)
[18]  arXiv:2205.03595 [pdf, ps, other]
Title: $λ$-domain VVC Rate Control Based on Game Theory
Subjects: Multimedia (cs.MM); Multiagent Systems (cs.MA)
[19]  arXiv:2205.04404 (cross-list from cs.CL) [pdf, other]
Title: [email protected]: A Comparative Analysis for Troll-Based Meme Classification
Comments: Accepted at DravidianLangTech-ACL2022 (Colocated with ACL-2022). disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images, videos, network structure, temporality
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[20]  arXiv:2205.04402 (cross-list from cs.CL) [pdf, other]
Title: Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations
Comments: Accepted at CONSTRAINT 2022 (Colocated with ACL-2022), disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images, videos, network structure, temporality
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[21]  arXiv:2205.04264 (cross-list from cs.CV) [pdf, other]
Title: SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment
Comments: CVPR2022 Workshop (CLIC) accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[22]  arXiv:2205.04188 (cross-list from cs.CV) [pdf, other]
Title: Joint learning of object graph and relation graph for visual question answering
Comments: 6 pages, 4 figures, Accepted by ICME 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[23]  arXiv:2205.04029 (cross-list from cs.SD) [pdf, other]
Title: Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Comments: Interspeech submission
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[24]  arXiv:2205.03923 (cross-list from cs.CV) [pdf, other]
Title: Unsupervised Discovery and Composition of Object Light Fields
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[25]  arXiv:2205.03534 (cross-list from cs.CL) [pdf, other]
Title: Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 26 entries: 1-25 | 26 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2205, contact, help  (Access key information)