cafoscariNEWS

Agenda

13 Dic 2023 10:30

seminario/lezione

Diffusion Models for Image Editing and Novel View Synthesis

Aula B, edificio ZETA - Campus Scientifico via Torino

Speaker: Loris Bazzani, Amazon Research

Abstract:
In this talk, we present 2 lines of work that show how to adapt diffusion models to perform image editing and novel view synthesis. In the first part, we present a novel method for text-guided image editing, namely iEdit, that generates images conditioned on a source image and a textual edit prompt. As a fully-annotated dataset with target images does not exist, we propose to automatically construct a dataset derived from LAION-5B, containing pseudo-target images with their descriptive edit prompts given input image-caption pairs. This enables us to introduce a weakly-supervised loss to generate the pseudo-target image from the latent noise of the source image conditioned on the edit prompt. In the second part, we show how to generate novel views of an object by presenting a training-free algorithm that can be integrated into existing pre-trained diffusion models, named ViewFusion. Our approach adopts an auto-regressive method that implicitly leverages previously generated views as context for next view generation, ensuring robust multi-view consistency during the novel-view generation process. Through a diffusion process that fuses known-view information via interpolated denoising, our framework successfully extends single-view conditioned models to work in multiple-view conditional settings without any additional fine-tuning.

Bio Sketch:
Loris is a Principal Scientist at Amazon in Berlin. He has experience in prototyping and developing video understanding models for Amazon Video, multimodal product understanding models for recommendations and creating novel interactive shopping experiences. He obtained my Ph.D. in Computer Science from the University of Verona (Italy) in 2012 and visited the University of British Columbia working on tracking, re-identification, and attentional models. Before the current position, Loris was a postdoctoral fellow at Dartmouth College and a postdoctoral fellow at the Italian Institute of Technology, working on object recognition, localization, and temporal saliency prediction for videos.

Lingua

L'evento si terrà in inglese

Organizzatore

Dipartimento di Scienze Ambientali, Informatica e Statistica - Sebastiano Vascon

Tipologia	Nome	Fornitore (Dominio)	Descrizione	Durata	Informativa
Necessario	_pk_id[*]	unive/WAI	*	30 giorni	Informativa
Necessario	_pk_ses[*]	unive/WAI	*	1 giorno	Informativa
Necessario	_pk_ref[*]	unive/WAI	*	6 mesi	Informativa
Necessario	_gsas	unive/google	Memorizza le preferenze dell'utente	3 mesi	Informativa
Necessario	_opensaml_req_cookie%	unive	Gestione autenticazione e SingleSignOn (shibboleth)	sessione	Informativa
Necessario	_shibsession[], _shibsstate[]	Unive.it (www.unive.it)	Mantiene i dati di sessione del SingleSignOn	Sessione	Informativa
Necessario	PHPSESSID	Unive.it (www.unive.it)	Identificatore univoco dell'utente per gli applicativi del sito	Sessione	Informativa
Necessario	cookie[*]	Unive.it (www.unive.it)	Memorizza le preferenze dell'utente sui cookie	1 mese	Informativa
Necessario	cookie	idp.unive.it	Memorizza le preferenze dell'utente sui cookie	1 mese	Informativa
Necessario	fe_typo_user	Unive.it (www.unive.it)	Identificatore univoco dell'utente per l'area riservata del sito	sessione	Informativa
Necessario	JSESSIONID	Unive.it (www.unive.it)	Utilizzato per creare le sessioni in area riservata	sessione	Informativa
Necessario	ADMCMD_prev	Unive.it (www.unive.it)	Utilizzato per la gestione degli accessi al cms typo3	sessione	Informativa
Necessario	unive.it	Unive.it (www.unive.it)	servono a registrare le preferenze sui cookies	6 mesi	Informativa
Necessario	noiframe	Unive.it (www.unive.it)	servono a registrare le preferenze sui cookies	6 mesi	Informativa
Google - Youtube	__Secure-1PAPISID	Google (google.com)	Utilizzato per finalità di targeting per costruire un profilo degli interessi dei visitatori del sito web al fine di mostrare pubblicità Google pertinente e personalizzata.	1 mese	Informativa
Google - Youtube	CONSENT	Google (google.com)	Utilizzato da google per memorizzare le preferenze dell'utente	17 anni	Informativa
Facebook - Pixel	Socialpix	Unive.it (www.unive.it)	Servono a registrare le preferenze sui cookiesc	6 mesi	Informativa Università Ca' Foscari
Facebook - Pixel	_fbp	Unive.it (www.unive.it)	Traccia gli utenti per il retargeting pubblicitario su Facebook	3 mesi	Informativa facebook

Agenda

Diffusion Models for Image Editing and Novel View Synthesis

Aula B, edificio ZETA - Campus Scientifico via Torino

Lingua

Organizzatore

Cerca in agenda

cafoscariNEWS

Eventi

Lista cookies rilasciati

Agenda

Diffusion Models for Image Editing and Novel View Synthesis

Aula B, edificio ZETA - Campus Scientifico via Torino

Lingua

Organizzatore

condividi su:

Cerca in agenda