ChatGPT isn’t the only cool AI tool made by OpenAI — check out Sora, DALL-E, and more


ChatGPT creator OpenAI has other AI tools including AI video generator Sora, Dall-E, and Whisper.OpenAI

  • OpenAI is the startup behind the viral AI chatbot ChatGPT, but the company offers other AI products.

  • DALL-E creates images based on detailed text descriptions and Sora creates videos.

  • Whisper is a speech recognition model capable of transcribing and translating audio from many languages.

ChatGPT quickly went viral after its release in November 2022.

The tool has sparked controversy and even started a race among big tech companies like Google and Meta to develop their own, more powerful AI tools. OpenAI now has a $13 billion partnership with Microsoft and the tech giant integrated GPT-4o into Copilot and the Azure AI cloud suite.

However, the startup behind it, OpenAI, also offers other AI products – and it recently made its Sora AI video generator available to users. Take a look at some of the startup’s other AI products.

Screenshot of DALL-E search for “astronaut fish swimming in an ocean in space, digital art”DALL·E/OpenAI

SLAB

Just months before ChatGPT launched, OpenAI removed the waitlist for its AI generative art generatorSLAB. It quickly grew to over 1.5 million daily users as of September 2022, according to the company. wrote in a blog post. The tool – which quickly creates imaginative and detailed illustrations via a text prompt – has sparked controversy among artists upon its release, who debated what DALL-E and other AI art generators like this could prove useful for people in creative jobs.

Since the launch of DALL-E, OpenAI has released DALL-E 2 and DALL-E 3. The latest upgrade, DALL-E 3, includes more nuances and details than previous versions, the company said.

THE AI art generator creates original images called “generations” from detailed text prompts entered by a person. You can write detailed prompts like the one above – “astronaut fish swimming in an ocean in space, digital art” – and specify an art style or even reference a specific artist like Vincent Van Gogh.

You can also edit “generations” with the tool using one of the credits the program gives you each month, and upload your own photos to create images.

DALL·E AI-generated image of a “Van Gogh-style painting of a Formula 1 car driving on Mars”DALL·E/OpenAI

Whisper

The whisper is a automatic speech recognition model which transcribes speech to text and can identify and translate multiple languages ​​into English. The template can also transcribe in multiple languages.

The system was trained on 680,000 hours of multi-lingual, multi-tasking supervised data collected from the Internet, according to OpenAI.

In examples on its product page, Whisper transcribes nearly 30 seconds of audio of rapidly spoken text, a clip of a K-pop song, an audio clip of spoken French, and an audio clip of a person speaking with a loud accent.

Whisper is now used in a number of industries, including healthcare. Recently, an Associated Press report revealed that the technology is prone to hallucinations that include comments about race and violent rhetoric, which could pose problems if used in a medical context.

Manuscript

The Codex is an AI system that translates natural language in code. OpenAI claims that Codex is “best performing” in Python, but is also proficient in more than a dozen coding languages ​​like JavaScript and Swift.

The model can interpret simple commands entered by a user. OpenAI claims that the Codex is a “general-purpose programming model”, meaning it can be used for “essentially any programming task”, although its results may vary. OpenAI said it has successfully used the Codex “for transpilation, code explanation, and code refactoring.”

OpenAI provides some examples of how the Codex works, including using the model to program a space-themed game and give a computer voice commands to edit a Word document.

Sora

OpenAI announced during its “Shipmas” livestream on December 9 that it would release its AI video generator Sora to the public after making it available to a limited group of artists and creators in February.

Sora can generate videos up to 20 seconds from written instructions. The tool can also complete a scene and extend existing videos by filling in missing frames.

The company presented the new product and its various features, including the Explore page, which is a feed of videos shared by the Sora community. It also featured various preset styles for videos, such as pastel symmetry, film noir, and balloon world.

The company said in a blog post that the product “might struggle to simulate the physics of a complex scene,” as well as depict events that occur over time. There may also be confusion between left and right, the company said.

If the tool has already made a strong impression some in Hollywood, The tool’s product designer said during the demo that Sora wasn’t going to create feature films with a single click. Instead, the employee said the tool was more of an “extension of the creator behind it.”

API Tools

OpenAI also has a set of tools for developers. Its flagship reasoning models include o1, o1-mini, and the soon-to-be-released o3 and 03-mini models. OpenAI also offers GPT models, including GPT-4o and GPT-4o mini. OpenAI offers Chat Completion API, Assistants API, Batch API, and Real-time API. Users can explore models and APIs in OpenAI’s Playground without writing code. According to the company’s website, three million developers build with its tools.

Read the original article on Business Insider

Leave a Reply

Your email address will not be published. Required fields are marked *