Gemini Omni

185 points by meetpateltech 4 hours ago on hackernews | 85 comments

Gemini Omni is where Gemini’s ability to reason meets the ability to create. It delivers a leap in world understanding, multimodality, and editing.


Prompt: Make it look like the weird shape of my hand hole super zooms and magnifies the ground it's looking at in sharper quality.

Prompt: When the finger in <video> touches the animal toy play the sound the animal makes

Prompt: The lights of the apartments start turning on in sync with the music.

Prompt: Transport the violinist to the image environment

Prompt: Make the violin invisible

Prompt: Change the camera angle to be over the violinist’s shoulder.

Prompt: Change spaceship to <object>


Prompt: A marble rolling fast on a chain reaction style track, continuous smooth shot

Prompt: claymation explainer of protein folding, everything is made out of clay, no hands, stop motion, accurate

Prompt: A skeuomorphism stop motion explainer about how the brain hippocampus works with a compelling voiceover. Don’t add seahorses. No voice cuts at the end. Don’t add text.

Prompt: The video shows items of the alphabet. An unusual item starting with each letter is shown sitting on a table (like a Capybara for C, disco globe for D and Lava Lamp for L). All 26 letters must be represented by 26 items with matching lower thirds displaying the letter. Only one item and lower third at a time. Each lower third must look like a black marker written on a slip of paper in the bottom left. Rapid fire, roughly 9 frames per item at 24FPS. Last frame is a slip of paper "THE END". The whole video is accompanied by calm smooth music.

Prompt: word by word, one word on a the screen at a time: did, you, know, that, this, model, can, do, pretty, good, text!? each word appears with a different animated style, perfect pacing to a rhythm, sizzle reel


Creating your prompts

Use our prompt guide to create realistic, coherent, and creative output.

Training/development evaluations including automated and human evaluations carried out continuously throughout and after the model’s training, to monitor its progress and performance

Human red teaming conducted by specialist teams who sit outside of the model development team, across the policies and desiderata, deliberately trying to spot weaknesses and ensure the model adheres to safety policies and desired outcomes

Automated red teaming to dynamically evaluate Gemini Omni Flash for safety and security considerations at scale, complementing human red teaming and static evaluations

Ethics and safety reviews conducted ahead of the model’s release

Content created or edited with Omni in the Gemini app, Google Flow or YouTube includes our imperceptible SynthID digital watermark and C2PA Content Credentials. You can easily verify content through the Gemini app and coming soon to Chrome and Search. You can find out more about how we're expanding our content transparency and verification tools to help you understand how content was created and edited across the web in our blog post.


Gemini

Supercharge your creativity and productivity

Google Flow

An AI creative studio built with and for creatives

YouTube Shorts

A shorter way to discover, watch, and create on YouTube