Microsoft has developed artificial intelligence capable of paying close attention to individual words when generating images from caption-like text descriptions. The technology, simply named The Drawing Bot, can generate imagery of everything from “ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus.”
“If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch,” Xiaodong He, a principal researcher and research manager in the Deep Learning Technology Centre at Microsoft’s research lab in Washington, describes. “These birds may not exist in the real world – they are just an aspect of our computer’s imagination of birds.”
The Drawing Bot is the latest installation in a series of technological advancements made by Xiaodong and the rest of his team. Over the past 50 years, they have developed the CaptionBot which automatically writes photo captions as well as technology that answers questions humans ask about images, such as the location of an object within an image.
The development of image generation is particularly significant as it requires The Drawing Bot to imagine details that are not contained in the caption. “That means you need your machine learning algorithms running your artificial intelligence to imagine some missing parts of the images,” says Pengchuan Zhang, an associate researcher on the team.
Predicted uses for The Drawing Bot range from a sketch assistant for painters and interior designers, to a voice-activated tool for photo editing. With more computing power Xiaodong imagines the technology could generate animated films based on screenplays, automating and augmenting the work that animators do by removing some of the manual labour involved.
- David Lane talks us through his art direction for Robyn's newly released record
- Friday Mixtape: Vanessa Carlton and Godflesh combine thanks to The Beautiful Meme
- Jenny Jiao Hsia's game designs are as delightfully weird as they are weirdly delightful
- Luke Boland communicates industrialisation through his expansive photographs
- Okuyama Taiki became interested in design while running a free bookshop in Tokyo
- Congo Tales offers an alternative to fear-based environmental messaging
- This is an article about Wieden+Kennedy’s clever ad campaign - No B.S
- Combining thoughtful design and big business: an interview with Made Thought
- Iceland’s Christmas advert banned from broadcast for being too political
- The Saul Bass Archive looks back on the trailblazer’s rare poster design
- Typeface Pickle-Standard both obeys and rejects the grid at the same time
- Cornelius de Bill Baboul's latest project is "like Baudelaire in the age of McDonalds"