ChatGPT Goes Visual: Microsoft's New Model Incorporates Visual Foundation Models

Microsoft has introduced a new model called “Visual ChatGPT” that allows users to interact with ChatGPT using both text and images. The system combines different types of Visual Foundation Models, such as Transformers, ControlNet, and Stable Diffusion, with ChatGPT to enable the sending and receiving of images during chats, as well as injecting visual prompts for editing images.

According to the paper titled “Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models,” the visual transformer models and ChatGPT are experts of specific tasks with fixed inputs and outputs. However, combining them makes image generation and manipulation limitless. To bridge the gap between ChatGPT and VFMs, the paper proposes the use of a Prompt Manager with features such as informing ChatGPT about each VFM’s capabilities, converting visual information into language format, and managing the histories, priorities, and conflicts of different VFMs.

With the Prompt Manager, ChatGPT can leverage VFMs and receive their feedback in an iterative manner until the users’ requirements are met. Users can interact with ChatGPT using images and ask for complex image questions or visual editing by collaborating with different AI models in multi-steps. They can also ask for corrections and feedback on results. The GitHub repository provides more information on the new model.

TechMorung is proudly hosted on A2Hosting

Popular Post

10 Reasons why A2hosting is the perfect WordPress hosting in India

10 websites to help you monetize your Instagram account

Tired of Windows? Here are 4 Windows OS alternative you can use

5 Best Free WordPress SEO Plugin to rank your blog in 2020

ChatGPT Goes Visual: Microsoft’s New Model Incorporates Visual Foundation Models

Leave a Reply Cancel reply

Stay Connected

Categories

News

Must Read

10 Reasons why A2hosting is the perfect WordPress hosting in India

10 websites to help you monetize your Instagram account

Tired of Windows? Here are 4 Windows OS alternative you can use

5 Best Free WordPress SEO Plugin to rank your blog in 2020

Create an Amazing Tech News Website

You Might also Like

9 SEO Blogs You Need To Follow Right Away To Master SEO

How to install & execute BeEF hacking tool

Highest-Paid YouTubers of 2020

Microsoft Rewards: Maximizing Your Earnings While Searching the Web