😊 Bio
I am an AI Research Leader with 15+ years of experience advancing AI, from classical computer vision and machine learning to today's foundation and multimodal generative models.
I specialize in adaptive and collaborative multimodal learning and generation, with a forward-looking emphasis on: 1) specialization of large multimodal and diffusion-based models, 2) controllable multimodal generation and editing for data synthesis, 3) interaction-native modeling and learning, and 4) impact on real-world industrial applications.
My background integrates deep research roots with industrial execution:
- During nearly a decade at Amazon, I served as a Principal Scientist, leading high-impact core research and product efforts across Prime Video, Alexa, and mobile/.com shopping. I co-developed novel models and architectures for video understanding, vision-language representation, Large Multimodal Models, and diffusion models. My work powered AI-driven features such as live sports highlights, virtual try-on, interactive product recommendations, and shopping assistants, used by millions of users worldwide and generating O(XXM) USD in business impact.
- I spent the first part of my career in academia, obtaining my Ph.D. in Computer Science from the University of Verona (Italy) in 2012 supervised by Prof. Vittorio Murino and Prof. Marco Cristani. I was a visiting student at the University of British Columbia with Prof. Nando de Freitas. I was a postdoctoral fellow at Dartmouth College working with Prof. Lorenzo Torresani and I was a postdoctoral fellow at the Italian Institute of Technology working with Prof. Vittorio Murino.
Creativity fuels my work in both science and sound. A long-time keyboardist and former band member, I am currently composing and producing original music in my home studio. You can explore my latest tracks here: Listen on SoundCloud.
📢 News
- Now: I am searching for a new home to join, seeking an environment where I can lead high-stakes innovation and drive the next generation of AI for industry.
- Mar 2, 2026: Invited guest lecture on Multimodal Intelligence at University of Utah. Thanks Ziad!
- Feb 21, 2026: 1/2 papers accepted at CVPR 2026!
- Dec 5, 2025: Invited speaker at the University of Trento and FBK. Thanks Yiming!
- Nov 28, 2025: Invited speaker at the Turin AI Fall School 2025. Thanks Tatiana!
- Oct 27, 2025: Invited speaker at the IIT. Thanks Vittorio!
📝 Research, Publications and Patents [Google Scholar]
Interactive Episodic Memory with User Feedback
N. Subedi, L. Bazzani, Z. Al-Halah.
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval
Z. Wang, S. Ramasinghe, C. Xu, J. Monteil, L. Bazzani, T. Ajanthan
In International Conference on Computer Vision (ICCV), 2025
UniCoRN: Unified Commented Retrieval Network with LMMs
M. Jaritz, M. Guillaumin, S. Sternig, L. Bazzani
Arxiv, 2025
iEdit: Localised Text-guided Image Editing with Weak Supervision
R. Bodur, E. Gundogdu, B. Bhattarai, T-K Kim, M. Donoser, L. Bazzani
In Computer Vision and Pattern Recognition (CVPR) Workshops, 2024
Contrastive Language-Action Pre-training for Temporal Localization
M. Xu, E. Gundogdu, M. Lapin, B. Ghanem, M. Donoser, L. Bazzani
Arxiv, 2022
Localized Triplet Loss for Fine-Grained Fashion Image Retrieval
A. D’Innocente, N. Garg, Y. Zhang, L. Bazzani, M. Donoser
In Computer Vision and Pattern Recognition (CVPR) Workshops, 2021
Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval
Y. Chen, L. Bazzani
In European Conference on Computer Vision (ECCV), 2020
Group Detection and Tracking using Sociological Features
S. Vascon, and L. Bazzani
Group and Crowd Behavior for Computer Vision, 2017
Approximate Log-Hilbert-Schmidt distances between covariance operators for image classification
H. Q. Minh, M. San Biagio, L. Bazzani, V. Murino
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
Kernel Methods on Approximate Infinite-Dimensional Covariance Operators for Image Classification
H. Q. Minh, M. San Biagio, L. Bazzani, V. Murino
Arxiv, 2016
Weighted bag of visual words for object recognition
L. Bazzani*, M. San Biagio*, M. Cristani, V. Murino
In IEEE International Conference on Image Processing (ICIP), 2014
Semi-supervised multi-feature learning for person re-identification
D. Figueira, L. Bazzani, H.Q. Minh, M. Cristani, A. Bernardino, V. Murino
In International Conference on Advanced Video and Signal-based Surveillance (AVSS), 2013
Person re-identification with a PTZ camera: an introductory study
P. Salvagnini, L. Bazzani, M. Cristani, V. Murino
In International Conference on Image Processing (ICIP), 2013
Online bayesian non-parametrics for social group detection
M. Zanotto, L. Bazzani, M. Cristani, V. Murino
In British Machine Vision Conference (BMVC), 2012
Multiple-shot person re-identification by chromatic and epitomic analyses
L. Bazzani, M. Cristani, A. Perina, and V. Murino
Pattern Recognition Letters (PRL), 2012
Towards computational proxemics: Inferring social relations from interpersonal distances
M. Cristani, G. Pagetti, A. Vinciarelli, L. Bazzani, G. Menegaz, V. Murino
In International Conference on Social Computing (SocialCom), 2011
Multiple-shot person re-identification by hpe signature
L. Bazzani, M. Cristani, A. Perina, M. Farenzena, V. Murino
In International Conference on Pattern Recognition (ICPR), 2010
Collaborative particle filters for group tracking
L. Bazzani, M. Cristani, V. Murino
In International Conference on Image Processing (ICIP), 2010