生成モデルに関する講演

1 view

Skip to first unread message

Lin Gu

unread,

Apr 16, 2024, 12:37:10 AM4/16/24

to im...@imageforum.org

Image-MLの皆様

理研AIPの谷ともします。

４月２２日（月）13:00から、
カリフォルニア大学マーセド校（University of California, Merced）の
Ming-Hsuan Yang教授に以下の内容で講演をしていただきます。
Ming-Hsuan Yang教授は、
Googleのresearch scientistでもあり、
生成モデルに関する分野における著名な研究者です。

今回のセミナーはハイブリッド開催となり、使用言語は英語です。

講演タイトル: Recent results on learning with diffusion models
（オーガナイザーによる和訳：拡散モデルによる学習に関する最近の成果）

講演者: Prof. Ming-Hsuan Yang Professor at UC Merced and Google.

時間: ４月２２日（月） 13:00-14:00

実施形態:　ハイブリッド
オンライン：
https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/171794

対面参加：
東京都目黒区駒場4丁目6番1号東京大学駒場リサーチキャンパス　先端研４号館４階４２４室

Abstract: Diffusion models have been successfully applied to text-to-image generation with state-of-the-art performance. In this talk, I will discuss how these models can be used for low-level vision tasks and 3D scenes. First, I will present our findings on exploiting features from diffusion models and transformers for zero-shot semantic correspondence and other applications. Next, I will describe how we exploit diffusion models as effective prior for dense prediction, such as surface normal, depth, and segmentation. I will then discuss how diffusion models can facilitate articulated 3D reconstruction, 3D scene generation, and novel view synthesis. When time allows, I will present other results on fine-grained text-to-image generation and pixel-wise visual grounding of large multimodal models.

講演概要（オーガナイザーによる和訳）：
拡散モデルは、最先端の性能でテキストから画像への生成に成功しています。この講演では、これらのモデルが低レベルのビジョンタスクや3Dシーンにどのように使用されるかについて議論します。まず、拡散モデルとトランスフォーマーからの特徴の利用によるゼロショットの意味的対応やその他のアプリケーションに関する発見を紹介します。次に、表面法線、深度、セグメンテーションなどの密な予測に対して拡散モデルを効果的な事前条件として利用する方法について説明します。その後、拡散モデルが関節3D再構成、3Dシーン生成、および新しい視点の合成にどのように役立つかについて議論します。時間が許す限り、細かいテキストから画像への生成や大規模な多モーダルモデルのピクセル単位の視覚的基準づけに関するその他の結果も紹介します。

Bio: Ming-Hsuan Yang is a Professor at UC Merced and a Research Scientist with Google. He received the Google Faculty Award in 2009 and CAREER Award from the National Science Foundation in 2012. Yang received paper awards at UIST 2017, CVPR 2018, ACCV 2018, and Longuet-Higgins Prize in CVPR 2023. He is an Associate Editor-in-Chief of PAMI, Editor-in-Chief of CVIU, and Associate Editor of IJCV. He was the Program Chair for ACCV 2014 and ICCV 2019 and Senior Area Chair/Area Chair for CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, IJCAI, and AAAI. Yang is a Fellow of the IEEE and ACM.

略歴（オーガナイザーによる和訳）：
Ming-Hsuan Yang氏はカリフォルニア大学マーセド校の教授であると同時に、Googleの研究者をされています。2009年にGoogle Faculty Awardを、2012年にはNational Science FoundationからCAREER Awardを受賞しました。Yang氏はUIST 2017、CVPR 2018、ACCV 2018で論文賞を受賞し、CVPR 2023のLonguet-Higgins Prizeも受賞しました。また、PAMIのAssociate Editor-in-Chief 、CVIUのEditor-in-Chief、およびIJCVのAssociate Editorもされています。ACCV 2014およびICCV 2019のプログラム委員長も務めており、CVPR、ICCV、ECCV、NeurIPS、ICLR、ICML、IJCAI、AAAIのシニアエリアチェアおよびエリアチェアもされています。またYang氏はIEEEとACMのフェローでもあります。

皆様のご参加をお待ちしております。

よろしくお願いいたします。

谷林

----------------------------------------------
Lin GU, Ph.D (谷林)
RIKEN AIP
Research Scientist
https://scholar.google.com/citations?user=gIEZe5IAAAAJ&hl=en
理化学研究所　革新知能統合研究センター(AIP)
医用機械知能チーム　研究員
----------------------------------------------

Reply all

Reply to author

Forward

0 new messages