Security

OpenAI expands multi-modal capabilities with updated text-to-video model

ByPablo Santiago December 10, 2024

OpenAI has released a new version of its text-to-video AI model, Sora, for ChatGPT Plus and Pro users, marking another step in expansion into multimodal AI technologies.

The original Sora model, introduced earlier this year, was restricted to safety testers in the research preview phase, limiting its availability.

The new Sora Turbo version offers significantly faster performance compared to its predecessor, OpenAI said in a blog post.

Sora is currently available to users across all regions where ChatGPT operates, except in the UK, Switzerland, and the European Economic Area, where OpenAI plans to expand access in the coming months.

ChatGPT, which gained global prominence in 2022, has been a driving force behind the widespread adoption of generative AI. Sora reflects OpenAI’s ongoing efforts to maintain a competitive edge in the rapidly evolving AI landscape.

Keeping pace with rivals

The move positions OpenAI to compete with similar offerings from rivals like Meta, Google, and Stability AI.

“The true power of GenAI will be in realizing its multi-model capabilities,” said Sharath Srinivasamurthy, associate vice president at IDC. “Since OpenAI was lagging behind its competitors in text to video, this move was needed to stay relevant and compete.”

However, both Google and Meta outpaced OpenAI in making their models publicly reviewable, even though Sora was first introduced in discussions back in February.

“OpenAI likely anticipated becoming a target if it launched this service first, so it seems probable that they waited for other companies to release their video generation products while refining Sora for public preview or alpha testing,” said Hyoun Park, CEO and chief analyst at Amalgam Insights. “OpenAI is offering longer videos, whereas Google supports six-second videos and Meta supports 16-second videos.”

Integration remains a work in progress, though OpenAI is expected to eventually provide data integration for Sora comparable to its other models, Park added.

Managing regulatory concerns

Sora-generated videos will include C2PA metadata, enabling users to identify the content’s origin and verify its authenticity. This is significant amid global regulatory efforts to ensure AI firms adhere to compliance requirements.

“While imperfect, we’ve added safeguards like visible watermarks by default, and built an internal search tool that uses technical attributes of generations to help verify if content came from Sora,” OpenAI said in the post.

Even with such safeguards, the use of data in training AI models continues to spark debates over intellectual property rights. In August, a federal judge in California ruled that visual artists could proceed with certain copyright claims against AI companies like Stability AI.

“As with all of OpenAI’s generative tools, Sora faces challenges related to being trained on commercial data, which is often subject to copyright and, in some cases, patents,” Park said. “This could create opportunities for vendors like Anthropic and Cohere, which have been more focused on adhering to EU governance guidelines.” Extensive testing is critical for video-based generative AI applications due to concerns such as the rise of deepfakes, which likely contributed to the time it took OpenAI to release the model, according to Srinivasamurthy.

Security

How to use Loop components in Microsoft 365 apps

ByPablo Santiago October 1, 2024

Microsoft’s ambitious collaboration app, Microsoft Loop, includes shared workspaces as well as portable content snippets called Loop components. These components can be shared and embedded in multiple Microsoft 365 apps. What makes Loop so useful is that those shared components can be updated by multiple collaborators, and the contents of these components stay in sync…

Security

Intel’s CHIPS Act grant reduced as production delays and losses mount

ByPablo Santiago November 25, 2024

The US government has scaled back Intel’s preliminary CHIPS Act grant from $8.5 billion to under $8 billion, reflecting concerns over the company’s delayed investments and financial woes, The New York Times reported. The funding was part of the government’s effort to boost domestic semiconductor manufacturing amid growing global competition. Intel, originally seen as the…

Security

ChatGPT’s Windows app beats Microsoft Copilot for productivity

ByPablo Santiago November 20, 2024

Microsoft’s Copilot AI assistant appears to be transforming into a chatty AI sidekick, and I’ve seen quite a few Copilot users who aren’t happy about it. Thankfully, there’s now another option for anyone interested in using AI purely for productivity — a full-featured ChatGPT app for Windows PCs. Even at launch, ChatGPT’s Windows app is…

Security

Nvidia reportedly trained AI models on Youtube data

ByPablo Santiago August 7, 2024

Nvidia scraped huge amounts of data from YouTube to train its AI models, even though neither Youtube nor individual YouTube channels approved the move, according to leaked documents obtained by 404 Media via Futurism. Among other things, Nvidia reportedly used the YouTube data to train its deep learning model Cosmos, an algorithm for automated driving, a human-like…

Security

Microsoft Copilot gets Voice and Vision features

ByPablo Santiago October 1, 2024

Microsoft on Tuesday announced several updates to its free Microsoft Copilot and paid-for Copilot Pro services aimed at making the personal AI assistant more powerful and easier to converse with. Among the updates is Copilot Vision. Built natively into Microsoft’s Edge browser, the Vision feature lets Copilot see what a user sees when surfing the…

Security

You’ll soon be able to clone your voice to speak other languages in Teams

ByPablo Santiago November 20, 2024

In connection with this year’s Ignite conference, Microsoft has unveiled a new interpretation tool that will be added to Teams in the spring. What makes the voice cloning tool — currently called “Interpreter In Teams” — special is that users will be able to use your own voice to speak in other languages in real time….

Keeping pace with rivals

Managing regulatory concerns

Similar Posts

Leave a Reply Cancel reply

Follow Us