AI This Week: Key Innovations in AI-Enhanced Creativity and Productivity

The landscape of AI technology witnessed notable innovations this week, with significant updates to Microsoft 365’s Copilot, enhancements to OpenAI’s DALL-E, the unveiling of Apple’s groundbreaking ReaLM AI system, promising to elevate Siri’s capabilities, and the introduction of Stability AI’s Stable Audio 2.0. Let’s delve into the specifics of what’s new and exciting.

Apple’s ReaLM AI Challenges GPT-4’s Dominance with Enhanced Siri Capabilities

Apple’s latest innovation in AI, the ReaLM system, is poised to redefine the capabilities of Siri, claiming to “substantially outperform” OpenAI’s GPT-4. This advancement could lead to Siri remembering conversation history, understanding content on your iPhone screen, and recognizing background music. ReaLM, which stands for Reference Resolution As Language Modeling, introduces a unique method to improve conversational AI, potentially enhancing user interactions with their devices by making Siri more intuitive and context-aware. As Apple gears up for major AI announcements at WWDC 2024, ReaLM’s success in benchmarks against GPT models highlights its potential to set new standards in conversational AI, promising a smarter Siri on the horizon.

Microsoft 365’s Copilot Supercharged with GPT-4 Turbo and Enhanced Image Capabilities

Microsoft has unleashed a game-changing update for Microsoft 365’s Copilot, integrating the powerful GPT-4 Turbo from OpenAI. This upgrade is a boon for business subscribers, lifting limitations on chat interactions and enabling rich, in-depth analysis of documents, emails, and more, thanks to support for up to 300 pages of text in a single prompt.

Not stopping at text, Microsoft has also supercharged its image generation in Microsoft Designer for Copilot users. The leap from 15 to 100 images per day, powered by OpenAI’s DALL-E 3, promises to revolutionize how businesses create visual content, making the process quicker and more creative than ever.

Enhancing Creativity: DALL-E Introduces New Editing Features

OpenAI’s DALL-E, known for transforming text prompts into vivid images, has just become even more powerful. Recognizing that the first draft isn’t always perfect, OpenAI has launched a new editor interface tool, allowing users to tweak their AI-generated images with ease.

Fine-Tuning Made Easy

With the new editing capabilities, users can now make precise adjustments to their DALL-E creations. Whether it’s adding a splash of flowers or changing the colour scheme to black and white, the process is straightforward and intuitive. The update introduces two main ways to edit:

1) Selective Editing: Users can highlight specific parts of an image and describe desired changes, such as “add flowers,” using a new selection tool. This tool is accessible on both the desktop and the ChatGPT mobile app, offering flexibility across devices.

2) Overall Changes: For broader modifications, users can input commands directly into the chat sidebar, such as “make it black and white,” making it simple to alter the entire image’s aesthetic.

Stability AI Elevates AI-Generated Music with Stable Audio 2.0

Stability AI has just rolled out an exciting update to its text-to-music AI tool, Stable Audio, introducing a suite of new features that promise to revolutionize AI-generated music. Stable Audio 2.0 breaks new ground in the field of digital music production, offering users enhanced creative capabilities and longer track lengths.

What’s New?

Extended Play Time: Surpassing the previous 90-second limit, Stable Audio 2.0 enables the creation of tracks up to three minutes long, allowing for more complex and detailed musical pieces.
Audio-to-Audio Transformation: This innovative feature lets users upload existing audio samples and then modify them through textual prompts, opening up new possibilities for audio editing and creativity.
Advanced Sound Effects: The update brings improved sound effect generation, giving users more control and flexibility over the auditory elements of their creations.

Ethical Considerations and Accessibility

Stability AI has built Stable Audio 2.0 on a licensed dataset from AudioSparx, ensuring that artists and musicians are fairly compensated. The tool also respects opt-out requests, emphasizing ethical considerations in AI development. Moreover, while Stable Audio 2.0 offers a free tier for casual and amateur use, premium subscriptions are available for professionals seeking greater track generation allowances and access to creator licenses.

Keep ahead of the curve – join our community today!

Follow us for the latest discoveries, innovations, and discussions that shape the world of artificial intelligence.

Other Work

We should talk.

372 Richmond Street West, Suite 210
Toronto, ON M5V 1X6