{"id":986,"date":"2023-07-06T14:59:07","date_gmt":"2023-07-06T14:59:07","guid":{"rendered":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/?p=986"},"modified":"2024-01-16T13:48:56","modified_gmt":"2024-01-16T13:48:56","slug":"google-releases-google-soundstorm-paper","status":"publish","type":"post","link":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/google-releases-google-soundstorm-paper\/","title":{"rendered":"Google Releases Google Soundstorm Paper"},"content":{"rendered":"\n<h1 class=\"wp-block-heading\">Google Soundstorm<\/h1>\n\n\n\n<p>Google called it Insane Audio Generation, that&#8217;s Google Soundstorm. <\/p>\n\n\n\n<p>SoundStorm is a machine learning model that generates audio files. It is non-autoregressive.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm.jpg\"><img loading=\"lazy\" decoding=\"async\" width=\"721\" height=\"292\" src=\"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm.jpg\" alt=\"Google Soundstorm\" class=\"wp-image-1207\" srcset=\"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm.jpg 721w, https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm-600x243.jpg 600w, https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm-300x121.jpg 300w\" sizes=\"auto, (max-width: 721px) 100vw, 721px\" \/><\/a><figcaption class=\"wp-element-caption\">Google Soundstorm<\/figcaption><\/figure>\n<\/div>\n\n\n<p>\u201cNon-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token.\u201d (Apple Machine Learning)<\/p>\n\n\n\n<p>Requiring only a single forward pass as opposed to multiple iterations makes it really fast.<\/p>\n\n\n\n<p>Blazingly fast! <\/p>\n\n\n\n<p>In fact, Google Research highlights that \u201cWhen synthesizing dialogue segments of 30 seconds, we measured a runtime of 2 seconds on a single TPU-v4\u201d. (source)<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Example Prompt<\/h2>\n\n\n\n<p>For example, Google researchers gave it the following dialogue prompt:<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\"><em><code>Where did you go last summer? | I went to Greece, it was amazing. | Oh, that's great. I've always wanted to go to Greece. What was your favorite part? | Uh it's hard to choose just one favorite part, but yeah I really loved the food. The seafood was especially delicious. | yeah | And the beaches were incredible. | uhhuh | We spent a lot of time swimming, uh sunbathing, and and exploring the islands. | Oh that sounds like a perfect vacation! I'm so jealous. | It was definitely a trip I'll never forget | I really hope I'll get to visit someday!<\/code><\/em><\/pre>\n\n\n\n<p>The impressive output generated by the model (<a href=\"https:\/\/google-research.github.io\/seanet\/soundstorm\/examples\/\" target=\"_blank\" rel=\"noreferrer noopener\">source<\/a>):<audio src=\"https:\/\/google-research.github.io\/seanet\/soundstorm\/examples\/data\/dialogue\/rb_travel_1.mp3\"><\/audio><\/p>\n\n\n\n<p>Now think about this for a moment. You could create a simple pipeline like this:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Step 1:&nbsp;<\/strong>Generate dialogues with&nbsp;<a rel=\"noreferrer noopener\" href=\"https:\/\/blog.finxter.com\/11-best-chatgpt-alternatives\/\" target=\"_blank\">ChatGPT<\/a>&nbsp;or&nbsp;<a href=\"https:\/\/blog.finxter.com\/openapi-cheat-sheet\/\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI API<\/a><\/li>\n\n\n\n<li><strong>Step 2:&nbsp;<\/strong>Feed the dialogues into the SoundStorm model<\/li>\n\n\n\n<li><strong>Step 3:&nbsp;<\/strong>Upload to a podcasting platform<\/li>\n\n\n\n<li>Repeat!<\/li>\n<\/ol>\n\n\n\n<p>And 99% of people wouldn\u2019t even note a difference!<\/p>\n\n\n\n<p>But there are many more applications, such as replacing human readers of audiobooks (yet another job description that will be disrupted soon!), creating truly accessible web apps with human readers, and rapid prototyping for movies and (YouTube) videos.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google Soundstorm Google called it Insane Audio Generation, that&#8217;s Google Soundstorm. SoundStorm is a machine learning model that generates audio files. It is non-autoregressive. \u201cNon-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token.\u201d (Apple Machine Learning) Requiring only a single forward pass as opposed to multiple iterations makes it really fast. Blazingly fast! In fact, Google&hellip;<\/p>\n","protected":false},"author":1,"featured_media":1207,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[161],"tags":[],"class_list":["post-986","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-breaking-ai-news"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-content\/uploads\/2024\/01\/Google-Soundstorm.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/posts\/986","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/comments?post=986"}],"version-history":[{"count":2,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/posts\/986\/revisions"}],"predecessor-version":[{"id":1208,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/posts\/986\/revisions\/1208"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/media\/1207"}],"wp:attachment":[{"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/media?parent=986"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/categories?post=986"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.onemoremoney.com\/makemoneyonlinewithai\/wp-json\/wp\/v2\/tags?post=986"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}