NotebookLM Audio Overviews are now available in over 50 languages

ljoshua · 2025-04-30T18:06:23 1746036383

NotebookLM audio overviews/podcasts have been an absolute boon for my homeschooled kids. They devour audiobooks and podcasts, and they love learning by listening to these first. Then when we come together for class, we discuss what was covered, and can spend time diving into specifics or doing activities based on the content. It’s super nice to have another option for a learning medium here.

To generate them, we’ve scanned the physical book pages, and then with a simple Python script fed the images into GCP’s Document AI to extract the text en-masse, and concatenated the results together into a text-only version of the chapter. Give that text to NotebookLM and run with it.

SecretDreams · 2025-04-30T18:15:33 1746036933

I've used them. They're very nifty. Google did good here.

One thing I'll note is they only cover the "high level" aspects. No depth. I'd recommend them for someone who is either already very knowledgeable or for someone not at all knowledgeable who is looking for an overview before they plan to do deeper learning/studying through reading.

bbatsell · 2025-04-30T19:24:34 1746041074

> or for someone not at all knowledgeable who is looking for an overview before they plan to do deeper learning/studying through reading

Yep. This is what I have used them (sparingly) for — a scaffold to build the deeper learning onto. My brain struggles to retain information when it doesn’t have a high-level understanding of how/why a system works and how individual parts connect and interact, even if it is all eventually revealed later.

SecretDreams · 2025-04-30T21:49:58 1746049798

Very well said.

rosquillas · 2025-04-30T23:45:45 1746056745

Why not simply upload the pdf version of the scanned book or document? Extracting the text out of a scanned document via GCP Document AI API sounds like unnecessary use of resources

suddenlybananas · 2025-04-30T19:26:12 1746041172

I hope you encourage your kids to actually read as well.

ljoshua · 2025-04-30T20:37:52 1746045472

Oh don’t worry, they make excellent use of their library cards. :)

mikeocool · 2025-04-30T19:40:40 1746042040

NotebookLM podcasts are like a caricature of a real podcast. Every little verbal technique or narrative style that might be used by a normal podcaster in a subtle way is taken to an extreme.

The last one I listened to one host would repeat a keyword or phrase the other host had just said for emphasis — except they did incessantly — with multiple words in every sentence for many sentences in a row.

gervwyk · 2025-04-30T21:36:11 1746048971

Although I 100% agree, there is still a place for it. We place generated conversations with our case studies, and have receive good positive feedback so far, especially from the non-technical crowd. See example https://resonancy.io/case-studies/flava-process-digitization

Of course one can invest more in better authenticity but for what it is, I believe it is a good bang for effort..

Also, if you listen to it for a while, and get over the initial cringe, it becomes enjoyable, at least for me. Some visitors even asked if it was Ai generated. lol

Excited and frightened about the future where its more a real. This was a cool comparison I came across recently [2]

Interestingly I saw today the Descripts Avatars are made to sound and look non-realistic on purpose to avoid I guess all kind of issues, but they claim they want to leave something authentic on the table for real content. Which I think is a good move..

[1] - https://resonancy.io/case-studies/flava-process-digitization [2] - https://yummy-fir-7a4.notion.site/dia

Jolter · 2025-04-30T22:23:09 1746051789

I really enjoyed the “fire!” example. Very naturalistic!

meta_ai_x · 2025-05-01T02:53:33 1746068013

The Google NotebookLM team should take this comment as a badge of honor that they must be doing something right.

HN is the worst place to get product feedback (and I'm sure the NotebookLM team has internal metrics that validates their approach)

sakopov · 2025-04-30T23:03:22 1746054202

Yeah it was incredible in the beginning because it was so novel. Now it's just annoying. Half of the dialogue is repeated and it takes forever to get a point across. Never used NLM, but I wonder if that's something that can be tuned out?

BakeInBeens · 2025-04-30T23:45:41 1746056741

You can always use interactive mode and ask the podcaster for exactly what you want.

Spooky23 · 2025-05-01T00:57:56 1746061076

It sounds like an NPR podcast, which have been self parodied for a long time.

latentsea · 2025-04-30T21:03:06 1746046986

> NotebookLM podcasts are like a caricature of a real podcast. Every little verbal technique or narrative style that might be used by a normal podcaster in a subtle way is taken to an extreme.

So true.

mensetmanusman · 2025-04-30T23:01:06 1746054066

That sounds like a good comedy sketch!

retinaros · 2025-04-30T22:57:43 1746053863

It is slop in ways that even ghibli OAI is not. I never understood why it ever got good press

tkgally · 2025-04-30T20:20:45 1746044445

I tried it with Japanese, and it sounded about as good as in English. Only at one point did it sound unnatural. Japanese two-person conversation uses a lot of backchannelling (aizuchi), that is, semilinguistic sounds made by the listener to indicate attention and emotional reaction. At one point, the female voice said very distinctly "fumu fumu," which is how such aizuchi might be written in a script or manga. In actual speech, though, it would be a continuous sound without syllables and with a rising and/or falling intonation.

That brief TTS-like moment was the only time I was reminded that the voices were not human.

latentsea · 2025-04-30T21:06:08 1746047168

Sometimes you actually say fumu fumu out loud in a conversation for comedic effect.

tkgally · 2025-04-30T21:14:40 1746047680

Yes. In fact, I laughed when I heard the NotebookLM voice say that. It was comically out of place in the context.

okdood64 · 2025-04-30T22:13:04 1746051184

Have an audio link and timestamp to this?

tkgally · 2025-04-30T22:37:30 1746052650

The link is here:

https://notebooklm.google.com/notebook/c36ea335-6686-474d-bf...

The fumu fumu is at 01:50.

The podcast is about the impact of AI on higher education in Japan. I prompted NotebookLM briefly in Japanese about the topic, and it collected ten sources in Japanese and English that it used as the basis for the audio overview.

ipsum2 · 2025-04-30T17:45:18 1746035118

Do people find NotebookLM useful? For my use case of converting papers into podcasts, the explanations are too general (which misses the important parts of the paper) and contain too much fluff.

I suspect that changing the underlying model to Gemini 2.5 Pro would produce better transcripts, but right now there's no way of determining what model is being used.

primax · 2025-05-01T05:45:49 1746078349

I use it for loading up source materials and notes for a DnD campaign I run. Then I ask it questions when I need off the cuff answers, instead of researching.

It's also good for when I can't think of anything (like a background NPCs name and backstory)

alphabetting · 2025-04-30T18:06:49 1746036409

I found this prompt online and tweaking it for audio overviews works extremely well for me.

https://open.substack.com/pub/lawsen/p/notebooklm-podcasts-b...

Generate a deep technical briefing, not a light podcast overview. Focus on technical accuracy, comprehensive analysis, and extended duration, tailored for an expert listener. The listener has a technical background comparable to a research scientist on an AGI safety team at a leading AI lab. Use precise terminology found in the source materials. Aim for significant length and depth. Aspire to the comprehensiveness and duration of podcasts like 80,000 Hours, running for 2 hours or more.

smusamashah · 2025-04-30T22:39:29 1746052769

Where do you put this prompt?

sumedh · 2025-04-30T22:57:53 1746053873

In the Audio Overview, click on Customize and enter the prompt then generate the podcast.

dobladov · 2025-04-30T18:24:04 1746037444

I find NotebookML really useful as a book reading companion, by simply uploading the same book I want to read and asking questions about it, like:

- List the characters in chapter [x] and add a small description about each one. - What's [x] device used for? - What happened in chapter [x]?

It works very well without hallucinations and referencing all the answers.

da_chicken · 2025-04-30T22:21:54 1746051714

I've found it useful for processing the documentation for our data system. The vendor provides the doc in something around 60 PDF files, and a lot of the information is poorly organized within the PDFs.

I can say, "Hey, NotebookLM, explain the difference between feature X and feature Y to me," or, "How do I configure Z to work the way we want?" And while the answers still kinda suck because the documentation is pretty shitty, it's way faster than digging through the PDFs. And it cites the PDFs so I can (with some trouble) find the actual documentation in the PDF if I need it.

The worst part of it is that it only accepts 50 PDFs at once.

Honestly, though, the best use for it I've seen was when my GM added the PDF rulebooks to our TTRPG to NotebookLM. We were then able to ask NotebookLM rules questions, and it would answer us pretty well. That's what it's really great for.

I don't care about the audio features at all. The first thing I do is close the audio pane.

harryf · 2025-04-30T17:51:36 1746035496

It’s useful for getting summaries of long YouTube videos - I’m found it semi helpful for improving my Davinci Resolve skills.

That said Google is screwing the pooch as usual by trying to make it another walled garden. Slap an API on NoteboolLM already! The market research has already been done - there’s even an unofficial API https://www.reddit.com/r/notebooklm/comments/1eti9iz/api_for...

energy123 · 2025-04-30T18:00:10 1746036010

For YouTube videos it's hard to beat (1) copy transcript to clipboard (from eg tactiq) (2) paste into LLM chat and ask for summary

Rebelgecko · 2025-04-30T18:28:05 1746037685

Full disclosure, I work for Google opinions are my own etc etc

The LLM built into YouTube is one of the few LLM chatbots bolted onto existing apps that I actually find useful. Not just for summaries but questions like "what is the timestamp in this 2 hour video where they talk about _____".

jjwiseman · 2025-04-30T22:05:53 1746050753

"LLM built into YouTube…" The what now? This is the first I've heard of this.

Rebelgecko · 2025-05-01T03:01:58 1746068518

I thought it was for everyone my bad. Turns out except for some educational videos it's just for premium subscribers with certain location/language combos (you can probably guess which...)

https://support.google.com/youtube/answer/14110396?hl=en

mvdtnz · 2025-05-01T00:56:45 1746061005

I suspect he doesn't know he's talking about some internal tool that Google hasn't released to the public.

hu3 · 2025-04-30T19:15:23 1746040523

> "what is the timestamp in this 2 hour video where they talk about _____"

wow I gave up searching specific timestampos of long videos before. Never again.

Thank you!

trees101 · 2025-05-01T02:05:37 1746065137

how do we access this?

Rebelgecko · 2025-05-01T03:02:39 1746068559

https://support.google.com/youtube/answer/14110396

If you're not already a premium subscriber you may want to stick with other tools. I didn't mean to unintentionally advertise YouTube Premium:)

dieortin · 2025-04-30T22:14:24 1746051264

Or just paste the video URL onto Gemini and ask for summary, no need to search for any transcript

skeptrune · 2025-04-30T18:16:28 1746036988

It's hard for every AI product to beat that workflow lol. It works well for basically everything.

Spooky23 · 2025-05-01T01:03:00 1746061380

I used it for a a bootcamp class to study for an exam. I recorded about 50 hours of lecture and Q&A, and was able to generate good Anki cards from it. What was awesome was that I could ask “make a list of all of the topics the instructor thought would be questions on the exam” and it did a great job at that.

The podcast thing is more a novelty to me.

HanClinto · 2025-04-30T20:47:01 1746046021

I've found it very useful for providing accessible introductions to technical papers that are otherwise difficult for me to get started with understanding.

If I encounter a paper that is too difficult for me to digest just by reading, then I take a step back, feed it into NotebookLM, and listen to that summary. I've only done this a few times, but so far it hasn't failed to give me the overview and momentum that I need to take another stab and successfully dig into the paper and digest it on my own.

As others have noted, it can gloss over certain details and miss important points from time to time, but overall it does a fantastic job of giving me an introduction to a complex topic and making it far less indimidating / overwhelming.

jsnell · 2025-04-30T18:00:57 1746036057

You can enter a prompt from the "customize" dialog. Have you tried asking for a more specifics, assume the audience is an expert on the subject, and cut down on the fluff?

jszymborski · 2025-04-30T17:51:25 1746035485

I've run them on my own papers and, while sometimes they are accurate, they are sometimes very very wrong and misrepresent things. And I don't mean in nuanced or unimportant ways.

The TTS is amazing, but the audio overviews are frankly useless for me.

jcims · 2025-04-30T18:00:19 1746036019

I haven't really found it interesting for technical content but do think it's somewhat useful for hashing out more subjective and/or personal things like goals, difficulties, conflict, etc.

pottertheotter · 2025-04-30T17:56:55 1746035815

What’s interesting is that the create podcast thing is just a feature of NotebookLM. But everyone thinks that’s what NotebookLM is

bongodongobob · 2025-04-30T18:06:11 1746036371

It seems to be the only unique feature though. Any LLM can summarize things for me or make bullet points.

twoWhlsGud · 2025-04-30T18:15:34 1746036934

If you have a corpus of documents you are working with (say thousands of pages of related standards docs), Notebook can be handy for doing targeted summaries of aspects of the docs with pointers back into the actual docs to the relevant source material. That's something I end up needing a lot (I've never used the podcast feature) and so it feels very differentiated to me...

alphabetting · 2025-04-30T18:10:56 1746036656

The one other unique thing I use from them is the interactive mind maps. Like a table of contents on steroids

bradly · 2025-04-30T19:25:59 1746041159

At Shopify I working as an engineer in financial services and certain changes required approval by our banking partners. I was able to upload our credit policies to NotebookLM and easily ask questions without having to ping our the legal team in Slack. I'm about as bearish as they come as far as AI tools go and NotebookLM was one of the few tools that felt useful to me straight away.

chupchap · 2025-05-01T00:52:32 1746060752

I used NotebookLM for holiday planning. I put in a dozen links with touristy things to do at the destinations and 5 odd Youtube videos. I then asked it to craft an itinerary as a travel agent who is planning holiday for a couple without kids. Included the type of things I would like to do and not do as well. The result was pretty good. The podcast generated was fun as well

TekMol · 2025-04-30T17:51:41 1746035501

I find the podcast style audio it produces super annoying.

Is there an easy way to simply have text read to me unaltered?

sega_sai · 2025-04-30T18:20:58 1746037258

Absolutely the same complaint. I wanted to see if it could summarize papers well, but I just could not handle all the conversation and attempts to make it 'exciting'. Especially in areas where I already know the background.

threeducks · 2025-04-30T18:58:31 1746039511

Over seven years ago, this has been foretold exactly by the show Silicon Valley:

https://www.youtube.com/watch?v=K3pYZwol6Dc&t=73s

Transcript of the fridge scene:

    Fridge (after a bar code was scanned): "Ah, there we go."
    Gilfoyle: "It's bad enough that it has to talk. Does it need fake vocal ticks like 'uh'." 
    Dinesh: "Well it just makes it sound more human."
    Gilfoyle: "Humans are shit. This thing is addressing problems that don't exist. It's solutionism at its worst. We are dumbing down machines that are inherently superior."

I would like to have a Gilfyole mode for NotebookLM where the machine answers only with cold precision instead of endless "Mmmhmm", "Yeah!", "Amazing!", "That's so cool!".

shreezus · 2025-04-30T19:23:26 1746041006

It's one of those things that's impressive initially, but after generating a couple it feels quite formulaic.

crawsome · 2025-04-30T18:06:25 1746036385

I really is an obnoxious level of over-enthusiasm.

kleiba · 2025-04-30T19:17:13 1746040633

Give German a try - trust me, you don't have to speak the language but anyone can tell that it's quite different in tone. No valley girls in Deutschland!

lenwood · 2025-04-30T19:24:25 1746041065

I like the NotebookLM podcasting feature, have used it a few times to come up to speed. There's one quirk of the dialogue that I find annoying though, the two speakers finish one another's sentences. At first I thought that was a nice touch, but it happens often enough that it became distracting. I should experiment with the prompt to limit how often it happens.

hu3 · 2025-04-30T18:43:37 1746038617

I like to feed Hacker News comments to generate a podcast.

It's good to get the big picture about the discussion with 300+ comments.

Almondsetat · 2025-04-30T20:30:31 1746045031

I really don't understand why they went with this podcast style. Sure, it makes an impression the first few times, great for a showcase or an announcement. The problem though is that it soon becomes pretty annoying, especially because the hosts go back and forth between knowing nothing and knowing everything about the topic. They should at least choose randomly which one does the explaining to whom.

razster · 2025-04-30T20:36:08 1746045368

Absolutely agree with you, we ran into the same issue. Our company actually tried using it for our software documentation and user onboarding, hoping it would be a helpful and engaging format. But the podcast-style delivery just didn’t fit our needs. It’s fine for a quick showcase or intro, but for ongoing support or business-oriented material, the format became distracting. If only they offered alternative styles—something more structured and professional—we might have stuck with it.

moribunda · 2025-04-30T20:52:45 1746046365

You should check new features - like asking questions as a listener.

I don't use it a lot, but it's useful when you want to have an engaging audio interface to long (50p+) reports, which you wouldn't normally read because it's not your area of expertise or you don't have time, but you can listen while doing some cardio or chores.

ccbikai · 2025-05-01T02:31:57 1746066717

His Chinese voice effect is not as good as Minimax.

You can use Hacker Podcadt to compare

https://hacker-podcast.agi.li/

ahmedfromtunis · 2025-04-30T19:07:54 1746040074

The best feature is by far the ability to interact with the "hosts" to ask for clarifications or to guide them into focusing on a particular aspect; even for things that weren't covered in the source material.

davidg707 · 2025-05-01T02:28:40 1746066520

I created a NotebookLM podcast based on a blog post I wrote and played it for my parents. They got very excited thinking that I 'made it' because other people were talking about my work. Then I told them what it really was and they were a little bit disappointed and a little bit amazed.

tinyhouse · 2025-04-30T18:32:02 1746037922

They don't have an app? strange.

anyfactor · 2025-04-30T17:37:11 1746034631

https://support.google.com/notebooklm/answer/15731776

  - Afrikaans
  - Albanian
  - Arabic
  - Armenian
  - Azerbaijani
  - Basque
  - Bengali
  - Bulgarian
  - Burmese (Myanmar)
  - Catalan
  - Cebuano
  - Chinese (Simplified)
  - Chinese (Traditional)
  - Croatian
  - Czech
  - Danish
  - Dutch
  - English
  - Estonian
  - Filipino
  - Finnish
  - French (Canada)
  - French (European)
  - Galician
  - Georgian
  - German
  - Greek
  - Gujarati
  - Haitian Creole
  - Hebrew
  - Hindi
  - Hungarian
  - Icelandic
  - Indonesian
  - Italian
  - Japanese
  - Javanese
  - Kannada
  - Konkani
  - Korean
  - Latin
  - Latvian
  - Lithuanian
  - Macedonian
  - Maithili
  - Malay
  - Malayalam
  - Marathi
  - Nepali
  - Norwegian (Bokmål)
  - Norwegian (Nynorsk)
  - Oriya
  - Pashto
  - Persian
  - Polish
  - Portuguese (Brazil)
  - Portuguese (Portugal)
  - Punjabi
  - Romanian
  - Russian
  - Serbian (Cyrillic)
  - Sindhi
  - Sinhala
  - Slovak
  - Slovenian
  - Spanish (European)
  - Spanish (Latin America)
  - Spanish (Mexico)
  - Swahili
  - Swedish
  - Tamil
  - Telugu
  - Thai
  - Turkish
  - Ukrainian
  - Urdu
  - Vietnamese

behnamoh · 2025-04-30T19:52:59 1746042779

> Persian

I'm glad the name of my native language is written correctly. In many cases, people say "Farsi", which is offensive to many Iranians because it's the Arabic version of the word "Parsi" (unlike Persian, Arabic doesn't have "p", "g", "ch", "zh").

It's like someone calling English "Anglaise" because that's how the French say it.

PS: Contrary to common belief, Persian and Arabic are totally different languages, though they have borrowed words from one another (think English and French). Persian is an Indo-European language whereas Arabic is Aramaic (same roots as Hebrew).

crazygringo · 2025-04-30T20:29:14 1746044954

> It's like someone calling English "Anglaise" because that's how the French say it.

That is the case for some other languages, though. We call the language German rather than Deutsch because Germani was the Latin name for tribes in the area, for example.

Or native names get modified too -- in English we don't call it Espanish, just Spanish, even though it comes from español.

The names of languages in other languages tend to get modified in tons of different and random ways for lots of reasons. Is there really a reason to take offense at it?

It doesn't bother me that Italians call me an americano instead of an American. It's just a letter change. So why is it so bothersome that it's called Farsi rather than Parsi? Can't the change from "p" to "f" be seen as an interesting historical quirk, due to the fascinating effect of Arabic on European languages in the Middle Ages? At the same time that we got Arabic words like "algebra" and "alcohol"?

FlyingSnake · 2025-04-30T20:14:48 1746044088

Interesting. This is the first time I’m hearing that Farsi is offensive to Iranians. None of my Irani friends have objected so I’m curious if I’m missing something.

Wikipedia says Farsi should be avoided in Western languages, but what about others? Persian is called Farsi in Indian subcontinent due to the deep historical connections we share. We have proverbs saying Farsi is the sign of a learned person etc.

omneity · 2025-04-30T22:52:16 1746053536

Arabic is not Aramaic. Please correct your sources.

I’m also quite curious about the sounds of “ch” and “zh” which exist in Arabic as ش and ج, or did you mean something else?

behnamoh · 2025-04-30T23:10:11 1746054611

"ch" is written as "چ" in Persian (sounds like channel).

"zh" is written as "ژ" in Persian (sounds like bourgeoisie in French).

omneity · 2025-04-30T23:35:51 1746056151

Looking at the Persian IPA table[0] for the letters you wrote, we get `/ʒ/` for `ژ` and `/tʃʰ/` for `چ`

In Arabic[1], there are two close phonemes: `/dʒ/` for `ج` and `/ʃ/` for `ش`

The difference in both phonemes is minimal and are practically affricates[2] of each other (where `d` or `t` can precede a `ʒ` or a `ʃ`), so it seems these sounds are present in both Arabic and Persian.

These variations are also within the dialectal distribution of either languages. For example `ج` is pronounced `/dʒ/` in Algeria and `/ʒ/` in Morocco.

0: https://en.wikipedia.org/wiki/Help:IPA/Persian

1: https://en.wikipedia.org/wiki/Help:IPA/Arabic

2: https://en.wikipedia.org/wiki/Affricate

myth_drannon · 2025-04-30T20:09:45 1746043785

Small nitpicking, Arabic is from a different branch of Semitic languages than Aramaic or Hebrew (which are very similar).

And TIL I learned that Aramaic replaced Hebrew in Judea because the Persian Empire maintained Aramaic as the official administrative language, and Jews brought it back, coming back from the Babylonian captivity.

riffic · 2025-04-30T17:50:56 1746035456

cool, thanks for the wall of text.

jszymborski · 2025-04-30T17:51:57 1746035517

There's a collapse feature (the [-] link at the top of the post)

crazygringo · 2025-04-30T20:10:49 1746043849

It's still far too much for a HN comment.

You have to scroll down a couple pages' worth before you even realize this might be SO long you need to collapse it. So then you've got to scroll back UP a couple pages, find the teensy [-] link...

It's enough to just post the link to the list of languages. The list itself doesn't belong in a comment here, when it's that long.

riffic · 2025-04-30T21:50:04 1746049804

not to mention most of us know how to click to a source to view the authoritative list. why repeat it here?