The latest models are natively multimodal. Audio, video, images, text, are all t...

		johnb231 10 days ago \| parent \| context \| favorite \| on: Claude 4 The latest models are natively multimodal. Audio, video, images, text, are all tokenised and interpreted in the same model.