Apple Unveils MGIE, An AI Model That Edits Photos Based On Your Text Commands
Apple's MGIE is capable of handling a wide range of editing scenarios, such as colour adjustments, object manipulations and more.
Apple's image editing AI model dubbed MGIE (short for MLLM-Guided Image Editing) is making waves. While it is still in early development, the model offers a glimpse into the future of intuitive photo editing.
The skyrocketing popularity and adoption of generative AI technology have paved the way for the development of artistic powerhouses like Microsoft's Image Creator (formerly Bing Image Creator) and Midjourney. However, this magic has a frustrating limitation: there's no quick way to edit an already-generated image. However, Apple is trying to change that.
Google's experimental image generation tool ImageFX lets you quickly modify your AI-generated concepts by allowing you to alter your prompt using expressive chips. Similarly, Apple researchers have released a new AI image tool that lets you edit photos with just words.
Instead of clicking through the menus, you just tell the MGIE model what you want: resize, flip, crop and even add filters via text prompts. It is also worth noting that the MGIE model has earned recognition at a leading AI research conference: the International Conference on Learning Representations (ICLR) 2024.
In the research paper, which was accepted at ICLR 2024, the researchers used a photo of a pepperoni pizza and the prompt "make it more healthy" to implement changes to the photo. The model added vegetables to the pepperoni pizza.
How does MGIE work?
MGIE delivers improved instruction-based image editing with the help of powerful AI models called MLLMs, which are capable of processing both text and images. Despite their remarkable cross-modal understanding and visual-aware response generation capabilities, MLLMs haven't been widely applied to image editing tasks.
Notably, MGIE uses MLLMs for the image editing process in 2 ways. Apple's AI image editing model uses MLLMs to get expressive instructions from user input. These concise and clear instructions act as guides for the editing process.
Aside from this, the model uses MLLMs to understand what you want to change, then imagines how the photo would look afterwards. This imagination guides the tool to make the changes you want, pixel by pixel, to match your photo.
Designed to get better with every edit, MGIE masters understanding, imagining, and editing all at the same time.
Using MGIE
MGIE is currently available as an open-source project on GitHub, where you can find the code, data and pre-trained models. Moreover, there's a demo notebook that serves as a guide for using MGIE for various editing tasks.
Alternatively, you can try out MGIE online through a web demo hosted on Hugging Face Spaces. So, it is safe to say that MGIE is an easy-to-use and flexible-to-customise image editing tool from the Cupertino-based tech giant.
After replacing Samsung as the world's most popular phone maker last month, Apple is now showing interest in joining the AI arms race, with CEO Tim Cook hinting at generative AI features for upcoming iPhone and MacBook models.
© Copyright IBTimes 2024. All rights reserved.