(Note: * indicates equal contribution)
TANGO: Training-free Embodied AI Agents for Open-world Tasks
IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2025
Following the Human Thread in Social Navigation
International Conference on Learning Representations (ICLR), 2025 [Spotlight]
Towards Polyp Counting in Full-Procedure Colonoscopy Videos
IEEE International Symposium on Biomedical Imaging (ISBI), 2025
Distilling Knowledge for Short-to-Long Term Trajectory Prediction
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension
International Conference on Pattern Recognition (ICPR), 2024
CVPR Workshop on Synthetic Data for Computer Vision (SynData4CV), 2024 [Extended Abstract]
Multi-Modal Transformer with Language Modality Distillation for Early Pedestrian Action Anticipation
Computer Vision and Image Understanding, Vol. 249, 2024
IEEE Transactions on Intelligent Transportation Systems (T-ITS), Vol. 25, Issue 12, pp. 20547-20560, 2024
IEEE Intelligent Transportation Systems Magazine, Vol. in press, 2024
International Conference on Pattern Recognition (ICPR), 2024
Exploiting Proximity-Aware Tasks for Embodied Social Navigation
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
CVPR Embodied AI Workshop, 2023 [Extended Abstract]
Weakly-Supervised Visual-Textual Grounding with Semantic Prior Refinement
British Machine Vision Conference (BMVC), 2023
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction
IEEE Int'l Conference on Acoustics Speech and Signal Processing (ICASSP), 2023 [Oral]
Knowledge-Based Systems, Vol. 275, Article 110699, 2023
Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Empowering Convolutional Neural Nets with MetaSin Activation
International Conference on Neural Information Processing Systems (NeurIPS), 2023
Online Learning of Reusable Abstract Models for Object Goal Navigation
IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2022
How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting
IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction
CVPR Precognition Workshop, 2022
Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
British Machine Vision Conference (BMVC), 2022 [Spotlight]
CVPR Workshop on Transformers for Vision (T4V), 2022 [Extended Abstract]
Paper arXiv preprint Extended Abstract Poster Video Code & Dataset
Early Pedestrian Intent Prediction via Features Estimation
IEEE International Conference on Image Processing (ICIP), 2022
Vision, S.I. "Symposium on Perception and Cognition and Kanizsa Lecture", Vol. 6, No. 2:29, pp. 1-19, 2022
Aligning and linking entity mentions in image, text, and knowledge base
Data & Knowledge Engineering, Vol. 138, 2022
Conditional Variational Capsule Network for Open Set Recognition
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
ICCV Workshop on Egocentric Perception Interaction and Computing (EPIC), 2021
AC-VRNN: Attentive Conditional-VRNN for Multi-Future Trajectory Prediction
Computer Vision and Image Understanding, Vol. 210, 2021
Am I Done? Predicting Action Progress in Videos
ACM Trans. on Multimedia Computing, Communications, and Applications (TOMM), Vol. 16, Issue 4, pp. 1-24, 2021
Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition
IEEE Int'l Conference on Acoustics Speech and Signal Processing (ICASSP), 2021
Exploiting Scene-specific Features for Object Goal Navigation
ECCV Workshop on Assistive Computer Vision and Robotics (ACVR), 2020
Knowledge Distillation for Action Anticipation via Label Smoothing
International Conference on Pattern Recognition (ICPR), 2020
A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata
International Conference on Pattern Recognition (ICPR), 2020
On Visual-Textual-Knowledge Entity Linking
European Conference on Artificial Intelligence (ECAI), 2020 [Highlight]
IEEE International Conference on Semantic Computing (ICSC), 2020
VTKEL: A resource for Visual-Textual-Knowledge Entity Linking
ACM Symposium on Applied Computing (ACM-SAC), 2020
Social and Scene-aware Trajectory Prediction in Crowded Spaces
ICCV Workshop on Assistive Computer Vision and Robotics (ACVR), 2019
Long-term Path Prediction in Urban Scenarios using Circular Distributions
Image and Vision Computing, Vol. 69, pp. 81-91, 2018
Context-aware Trajectory Prediction
International Conference on Pattern Recognition (ICPR), 2018
Learning without Prejudice: Avoiding Bias in Webly-Supervised Action Recognition
Computer Vision and Image Understanding, Vol. 173, pp. 24-32, 2018
Human Action Anticipation: Deep Learning Approaches Across Diverse Domains
University of Padova, PhD Thesis, April 2024
Prediction of Activities and Visual Concepts Under Complex and Changing Conditions
University of Padova, PhD Thesis, February 2023
Object and event recognition in multimedia archives using local visual features
University of Florence, PhD Thesis, April 2011