Connect with us

SEARCHENGINES

How BERT Helps With Google Search

Published

on

How BERT Helps With Google Search

Google has posted a new video on how BERT helps Google Search understand language. Google has been using BERT in search since 2018, we only knew about it in 2019. That being said, the short video basically says it is about Google understanding the little words better.

Here is the video:

Here is the transcript if you don’t want to listen:

If a pancake recipe told you to “mix the batter with the banana,” you probably wouldn’t think to use the banana as a mixing spoon. But what’s obvious to humans — things like context, tone, and intention — are actually very difficult for computers to pick up on. At its core, a Google Search is about understanding language. In order to return the right information, Google doesn’t just need to know the definition of the words… it needs to know what they all mean when strung together in a specific order. And that includes the smaller words like “for” and “to.” And when you think about how many different meanings a single word can have… you start to see how writing a computer program that takes all these nuances into account is pretty tough. See? Case in point. “Pretty” here doesn’t mean beautiful, it means “very.” More and more, people talk to Google the way they think and speak. And, more and more — Google is getting better at understanding what they mean. One of the biggest leaps forward in the history of Search came about with the introduction of “Bidirectional Encoder Representations from Transformers” or as we like to call it, BERT. BERT is a machine-learning, model architecture that helps Google process language and understand the context in which it appears. Search used to process a query by pulling out the words it thought were most important. For example, if you said, “can you get medicine for someone pharmacy” you would have gotten general results about pharmacies and prescriptions because it would have essentially ignored the word “for.” But with BERT, the LITTLE words are taken into account and it changes things. Search now understands you want to know if you can pick up medicine… prescribed to someone else. But how do you train a language model to pick up context? There’s a big difference between knowing words and understanding meaning. The model learns context by applying the same fill-in-the-blank principles it takes to complete a Mad Libs. So we take a phrase. We hide about 20% of the input words. And then we make the computer guess the words that are missing. Over time, the model begins to understand different words have different meanings depending on what’s around them. And the order in which they appear in that text, really matters. So when you search something complex like, “Fly fishing bait to use for trout in september montana” Search knows all the little words are important and because it now takes them all into account, Google can tell you the perfect bait for that time of year. BERT isn’t foolproof, but since implementing it in 2019, it’s improved a lot of searches. We should always be able to learn about whatever we’re curious about. And that’s why Search will always be working to understand exactly what you’re truly asking.

I am surprised Google did not release this video when we wrote about how Google uses AI in search.

Forum discussion at Twitter.




Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address

SEARCHENGINES

Google Search Not Indexing & Serving New Content?

Published

on

Google Logo Burning Ashes

Google Search may be having a new indexing or serving (or both) bug where it is now showing new content from sites that are creating new content. Sites like the Wall Street Journal, NY Times, CNN, Forbes, and others are showing very few new pages being indexed in the past hour by Google Search.

While all of these sites have produced dozens of stories in the past hour, most are showing 0 to 10 new content (URLs) in the Google Search index in the past hour.

Google has not posted any issue yet in its Search Status Dashboard and I do not see massive complaints yet but I received a notice on X about this (that post was deleted) but I still see the issue.

Here are some examples:

site:cnn.com – within hour – 2 results:

Google Cnn Index

site:wsj.com – within hour 2 results:

Google Wsj Index

WSJ down to nothing:

Wsj Hour

site:nytimes.com – within hour 10 results:

Google Nytimes Index

site:hollywoodreporter.com – within hour – 2 results:

Google Hollywood Index

site:forbes.com – within hour 0 results:

Google Forbes Index

Here are more examples of massive news sites not seeing much content in the Google index in the past hour:

Foxnews Google Index

Nbcnews Google Index

Apnews Google Index

Reuters Google Index

Cnbc Google Index

I often see indexing complaints about Google Search but I rarely see it on sites this big that produce a lot of content.

We last had a larger issue like this in January 2024, then in December 2023, October 2023 and then many more over the years.

Are any of you seeing this?

Update at 7:30pm ET: I waited over 30 minutes for this story to be indexed by Google and it was not. Normally this site is indexed within minutes of a story going live.

Ser Google Index Hour

So I checked Google Search Console and it said the page was crawled on Jun 20, 2024, 7:14:49 PM but the page was not in the Google index:

Gsc Not Indexed

Meanwhile, the live URL inspection tool says there is nothing preventing it from being indexed:

Gsc Can Be Indexed

I did request indexing manually to be sure – but still, not indexed.

Forum discussion at X.

Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

SEARCHENGINES

Daily Search Forum Recap: June 19, 2024

Published

on

Here is a recap of what happened in the search forums today, through the eyes of the Search Engine Roundtable and other search forums on the web.


Google posted its FAQs on AI Overviews, asking why you can’t disable then. Google may be showing fewer Reddit links. Google explains when it is not a good thing when Googlebot is crawling you more. Google is testing 6 people also ask by default. Google launched menu buttons in the Google Maps results. Google AdSense has a new privacy and messaging feature for privacy rules.


Search Engine Roundtable Stories:


  • Google AI Overviews FAQs Including Why You Can’t Disable AI Overviews


    Google has posted a new document in its forums named frequently asked questions about AI Overviews. In this document it has a section named “why can’t I disable AI Overviews?” The answer is that Google’s “goal is to help people find the information they’re looking for quickly and reliably.”

  • Report On If Google Showing Fewer Reddit Links In Search?


    Some in the SEO industry have been asking if Google has been showing fewer links in its search results to Reddit over the past several weeks. So Mordy Obserstein pulled some Semrush data that showed a slight downtick in Reddit results in Google’s Discussion and Forums section.

  • Google: Two Common Reasons When A Spike In Crawling Is Bad


    Google’s Gary Illyes posted on LinkedIn with two common examples of when a spike in Googlebot activity, crawling, is a bad thing. The short answer is when Googlebot gets to crawling an infinite section of your site (like calendar pages that goes on forever) and when your site is hacked with a ton of new hacked pages.

  • Google Tests Two More People Also Ask Results (6 PPA)


    Google is testing showing two more people also ask results, by default, which is a total of six people also ask, instead of the typical four people also ask that Google would show in its search results.



  • Google Local Panels Gains Menu Button


    Google seems to be adding a “menu” button to the Google Business Profiles, the local panels, in the web search results. I think Google has tested this one and off over the years, but as Marcin Karwowski noted, it seems to be rolling out now to some businesses.



  • New Google AdSense Privacy & Messaging For Users To Opt Out


    Google has begun rolling out a new privacy and messaging feature for AdSense ads in some US states. This is to comply with California, Colorado, Connecticut, Virginia, and Utah privacy laws. The feature allows the site to communicate to the user about opting out of the sale or sharing of their personal information.



  • Google Japanese Circular Keyboard


    Here is a photo from Daniel Waisberg’s trip to the Google Japan office, and it shows these circular Japanese keyboards on display. There are other items on display.

Other Great Search Threads:

Search Engine Land Stories:

Other Great Search Stories:

Analytics

Industry & Business

Links & Content Marketing

Local & Maps

Mobile & Voice

SEO

PPC

Search Features

Other Search

Feedback:


Have feedback on this daily recap; let me know on Twitter @rustybrick or @seroundtable, on Threads, Mastodon and Bluesky and you can follow us on Facebook and on Google News and make sure to subscribe to the YouTube channel, Apple Podcasts, Spotify, Google Podcasts or just contact us the old fashion way.



Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

SEARCHENGINES

Daily Search Forum Recap: June 18, 2024

Published

on

Here is a recap of what happened in the search forums today, through the eyes of the Search Engine Roundtable and other search forums on the web.


Google Search Console reports with # signs in them are not related to canonicalization in Google Search, it is related to Google Sitelinks. Bing Webmaster Tools can show clickthrough rates of over 100%, no joke. Google is testing black sitelinks. Google Maps added an edit and post button to the main buttons for Google Business Profiles. And no, Google Search does not always show the original source in its search results.


Search Engine Roundtable Stories:


  • Google: #s In Google Search Console Reports Are Unrelated To Canonicalization


    There is a lot of talk in the wider SEO community around pound signs, #, in the Google Search Console reports, meaning something about canonicalization. John Mueller from Google, along with a number of SEOs, said it has nothing to do with canonicalization. Instead, it has to do with Google tracking on-page sitelinks from the Google Search results.

  • Google: Our Search Results Do Not Always Show Original Source


    Google’s John Mueller said in the last SEO Google Office Hours video that Google’s “search results are not an indication of what Google’s systems consider to be the original source.” This means that just because Google ranks a piece of content, it does not necessarily mean that the content is the original source.



  • Bing Webmaster Tools Clickthrough Rate Over 100%


    Did you ever sort the Bing Webmaster Tools clickthrough-rate column from highest to lowest and notice that you can see a CTR higher than 100%, I even see 300%. Well, Fabrice Canel from Microsoft said this is possible when a searcher clicks on your listing more than once from the same search result set.


  • Google Maps Adds Edit & Post Button To Business Profile Listing


    Google has added an “edit” and “post” button more prominently as big buttons to Google Business Profiles within the Google Maps interface. Previously, these buttons were not as accessible in the Google Maps interface.

  • Google Search Tests Black Sitelinks


    Google is testing another variation of its sitelinks design, this one uses black font colors for the sitelinks. Of course, Google generally uses blue for those links, because links are generally in blue underlined font.



  • Google Japan Zashiki Seating


    Daniel Waisberg was at the Google Japan office and he snapped some photos, shared them on LinkedIn. Here is one of one of the cafe areas that has the Japanese floor seating, also called Zashiki /Tatami seating.

Other Great Search Threads:

Search Engine Land Stories:

Other Great Search Stories:

Analytics

Industry & Business

Links & Content Marketing

Local & Maps

Mobile & Voice

SEO

PPC

Search Features

Other Search

Feedback:


Have feedback on this daily recap; let me know on Twitter @rustybrick or @seroundtable, on Threads, Mastodon and Bluesky and you can follow us on Facebook and on Google News and make sure to subscribe to the YouTube channel, Apple Podcasts, Spotify, Google Podcasts or just contact us the old fashion way.



Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

Trending