iPhone Designer & OpenAI Team Up π¨ AI-Generated Music Is Here π΅ Religion in Decline in the Face of AI? π
AI Can Copy Your Voice With a 15-Second Sample and Google Competes with SORA
AI is changing everything. Don't get left behind. Each Thursday get the latest headlines and keep abreast of whatβs happening.
AI won't take your job, It is somebody using AI that will.
Designer of the iPhone & Sam Altman Seek Funding for Secretive AI Device
Famed ex-Apple designer Jony Ive and OpenAI CEO Sam Altman are reportedly in talks with venture capitalists to secure $1 billion in funding to develop a brand-new AI-powered device.
The nature of the device is unknown, but it won't be a smartphone. Initial discussions reportedly included SoftBank CEO Masayoshi Son.
Why it matters: This collaboration signals a potential move by major tech figures to create a revolutionary AI hardware offering.
With Ive's iconic design sense and Altman's AI expertise, this device could shake up the tech landscape.
The venture raises questions about the future of smart devices and how we interact with complex AI.
Suno AI Democratises Music Creation - Partners with Microsoft
The AI-powered music tool Suno AI allows anyone to create full songs in seconds. A recent partnership with Microsoft integrates Suno AI into the Copilot platform.
Creators retain ownership rights with paid plans.
Suno AI emphasises ethical music creation and avoids copyright infringement risks.
Why it matters: Suno AI represents a significant breakthrough in AI music generation. Its ease of use and Microsoft integration put powerful song creation tools directly into consumers' hands.
The emphasis on legal and ethical practices sets Suno AI apart in this rapidly developing field, highlighting the importance of responsible AI use.
π Try Suno AI to create a song for free
π₯ Suno AI hype video
OpenAIβs Voice Engine Mimic Your Voice With a 15 Second Sample
OpenAI introduces Voice Engine, a new AI model that can generate realistic, unique voices based on just a 15-second audio sample, the model can also replicate your voice but in other languages.
This technology promises exciting applications but raises ethical concerns about potential misuse.
Why it matters:
Accessibility Revolution: Voice Engine could reshape communication, aiding nonverbal individuals, enhancing education, and offering reading assistance.
Global Content Creation: Companies could easily translate content, preserving the original speakerβs accent, expanding their reach.
Ethical Dilemmas: Real-sounding synthetic voices raise the potential for deepfakes and harmful impersonations. OpenAI emphasises consent and transparency.
π° OpenAI blog post with examples of the model
Google Releases a SORA Competitor
Google's newly unveiled Lumiere AI is a step forward for video generation.
Unlike existing AI models that struggle with temporal consistency, Lumiere directly generates full-length videos through its innovative Space-Time U-Net (STUNet) architecture.
This unique approach enables smoother, more complex motion sequences.
Why it matters: A major issue with existing videoGen AI is that tooling for video editing is all done in post production (in a application like After Effects), Lumiere comes with a suite of editing capabilities.
Text-to-video: Lumiere works alongside Googleβs existing text-to-image models for multi-modal compatability.
Stylised Video Generation: The model can take the style from an image to create videos in various styles.
Video Stylisation: Lumiere takes text prompts to alter the video stylisation.
Cinemagraph: Users can select a portion of the video to animate with a text prompt.
Inpainting & Outpainting: Videos can be extended or have sections altered with inpainting or outpainting.
π° Lumiere research paper with examples
π₯ Lumiere hype video
Study Links Rise of AI and Robotics to Decline in Religious Belief
New research suggests that exposure to advanced automation technologies like AI and robotics may weaken religious faith.
The study finds a correlation between the adoption of these technologies and a decline in religious affiliation across countries and individuals.
Researchers theorise that AI's ability to solve complex problems may reduce reliance on traditional religious explanations.
Why it matters: This study highlights a potential shift in how people understand the world around them.
As AI and robotics become more sophisticated, they may challenge traditional sources of comfort and moral guidance.
This research raises important questions about the evolving relationship between technology, spirituality, and how we find meaning.
π° Article on religious decline (interactive graph inside)
Apple and Google Race to On-Device AI
Apple and Google reveal new on device language models, ReALM and ScreenAI, designed to enhance voice assistants and on-device intelligence.
These models demonstrate capabilities like understanding screen content, providing personalised experiences, and enabling more intuitive device interactions.
Why it matters: ReALM and ScreenAI promise a future where our devices understand us better than ever. This opens doors for streamlined interactions, accessibility enhancements, and intelligent assistants that learn our preferences and anticipate our needs.
With a focus on on-device processing, these models enhance privacy and reduce reliance on cloud-based AI.
Apple's ReALM
Aims to enhance Siri, potentially leading to a more conversational and helpful "Siri 2.0" in future iOS updates.
Specialises in resolving confusing references in conversations (e.g., "that one")
Leverages on-device processing for efficiency, privacy, and a smooth user experience.
Google's ScreenAI
Designed to understand user interfaces and infographics.
Can answer questions about what's on a screen, navigate UIs, and summarise screen content.
Will lead to more intuitive and accessible devices across platforms.
π° Apple's ReALM
π Google's ScreenAI research paper
AI Market Reaches Fever Pitch
The AI industry is in a whirlwind of activity. Generative AI's popularity has soared, while the once-popular Modern Data Stack faces challenges.
New AI infrastructure companies are emerging, and consolidation in the data analytics field is expected.
Why it matters: The AI landscape is shifting rapidly. The explosive growth of Generative AI promises innovation but also raises questions about hype, sustainability, and the potential for a slowdown.
Companies must adapt to capitalise on the opportunities and navigate the potential hurdles.
Challenges and Considerations: Big tech is heavily invested (Microsoft, Amazon, Google) with each having strengths and weaknesses. Ethical concerns around fairness, bias, and responsible deployment are growing.
Venture funding may become more selective, impacting startups but potentially leading to a focus on sustainable innovation.
π° Interactive map of AI companies and infrastructure
π State of AI report
Googleβs VLOGGER: AI Generates Realistic Talking Head Videos with Full Body Motion
A new Google research paper introduces VLOGGER, an AI system capable of creating photorealistic videos of a person talking and moving naturally, using only a single reference image and audio input.
VLOGGER surpasses existing methods by generating the entire person β not just the face β and doesn't require any manual editing or face detection.
Why it matters:
Immersive Media: VLOGGER propels us towards truly immersive video experiences. From personalised content creation to hyper-realistic video editing, the possibilities are vast.
Accessibility: This could upend video translation and dubbing, making it easier to consume content across languages without relying on voiceover artists or complex editing.
Ethical Considerations: As with any AI that generates human likenesses, responsible use is key to avoid deepfakes and misinformation.