Connect with us

SEO

5 Questions Answered About The OpenAI Search Engine

Published

on

5 Questions Answered About The OpenAI Search Engine

It was reported that OpenAI is working on a search engine that would directly challenge Google. But details missing from the report raise questions about whether OpenAI is creating a standalone search engine or if there’s another reason for the announcement.

OpenAI Web Search Report

The report published on The Information relates that OpenAI is developing a Web Search product that will directly compete with Google. A key detail of the report is that it will be partly powered by Bing, Microsoft’s search engine. Apart from that there are no other details, including whether it will be a standalone search engine or be integrated within ChatGPT.

All reports note that it will be a direct challenge to Google so let’s start there.

1. Is OpenAI Mounting A Challenge To Google?

OpenAI is said to be using Bing search as part of the rumored search engine, a combination of a GPT-4 with Bing Search, plus something in the middle to coordinate between the two .

In that scenario, what OpenAI is not doing is developing its own search indexing technology, it’s using Bing.

Advertisement

What’s left then for OpenAI to do in order to create a search engine is to devise how the search interface interacts with GPT-4 and Bing.

And that’s a problem that Bing has already solved by using what it Microsoft calls an orchestration layer. Bing Chat uses retrieval-augmented generation (RAG) to improve answers by adding web search data to use as context for the answers that GPT-4 creates. For more information on how orchestration and RAG works watch the keynote at Microsoft Build 2023 event by Kevin Scott, Chief Technology Officer at Microsoft, at the 31:45 minute mark here).

If OpenAI is creating a challenge to Google Search, what exactly is left for OpenAI to do that Microsoft isn’t already doing with Bing Chat? Bing is an experienced and mature search technology, an expertise that OpenAI does not have.

Is OpenAI challenging Google? A more plausible answer is that Bing is challenging Google through OpenAI as a proxy.

2. Does OpenAI Have The Momentum To Challenge Google?

ChatGPT is the fastest growing app of all time, currently with about 180 million users, achieving in two months what took years for Facebook and Twitter.

Yet despite that head start Google’s lead is a steep hill for OpenAI to climb.  Consider that Google has approximately 3 to 4 billion users worldwide, absolutely dwarfing OpenAI’s 180 million.

Advertisement

Assuming that all 180 million OpenAI users performed an average of 4 searches per day, the daily number of searches could reach 720 million searches per day.

Statista estimates that there are 6.3 million searches on Google per minute which equals over 9 billion searches per day.

If OpenAI is to compete they’re going to have to offer a useful product with a compelling reason to use it. For example, Google and Apple have a captive audience on mobile device ecosystem that embeds them into the daily lives of their users, both at work and at home. It’s fairly apparent that it’s not enough to create a search engine to compete.

Realistically, how can OpenAI achieve that level of ubiquity and usefulness?

OpenAI is facing an uphill battle against not just Google but Microsoft and Apple, too. If we count Internet of Things apps and appliances then add Amazon to that list of competitors that already have a presence in billions of users daily lives.

OpenAI does not have the momentum to launch a search engine to compete against Google because it doesn’t have the ecosystem to support integration into users lives.

Advertisement

3. OpenAI Lacks Information Retrieval Expertise

Search is formally referred to as Information Retrieval (IR) in research papers and patents. No amount of searching in the Arxiv.org repository of research papers will surface papers authored by OpenAI researchers related to information retrieval. The same can be said for searching for information retrieval (IR) related patents. OpenAI’s list of research papers also lacks IR related studies.

It’s not that OpenAI is being secretive. OpenAI has a long history of publishing research papers about the technologies they’re developing. The research into IR does not exist. So if OpenAI is indeed planning on launching a challenge to Google, where is the smoke from that fire?

It’s a fair guess that search is not something OpenAI is developing right now. There are no signs that it is even flirting with building a search engine, there’s nothing there.

4. Is The OpenAI Search Engine A Microsoft Project?

There is substantial evidence that Microsoft is furiously researching how to use LLMs as a part of a search engine.

All of the following research papers are classified as belonging to the fields of Information Retrieval (aka search), Artificial Intelligence, and Natural Language Computing.

Here are few research papers just from 2024:

Advertisement

Enhancing human annotation: Leveraging large language models and efficient batch processing
This is about using AI for classifying search queries.

Structured Entity Extraction Using Large Language Models
This research paper discovers a way to extracting structured information from unstructured text (like webpages). It’s like turning a webpage (unstructured data) into a machine understandable format (structured data).

Improving Text Embeddings with Large Language Models (PDF version here)
This research paper discusses a way to get high-quality text embeddings that can be used for information retrieval (IR). Text embeddings is a reference to creating a representation of text in a way that can be used by algorithms to understand the semantic meanings and relationships between the words.

The above research paper explains the use:

“Text embeddings are vector representations of natural language that encode its semantic information. They are widely used in various natural language processing (NLP) tasks, such as information retrieval (IR), question answering…etc. In the field of IR, the first-stage retrieval often relies on text embeddings to efficiently recall a small set of candidate documents from a large-scale corpus using approximate nearest neighbor search techniques.”

There’s more research by Microsoft that relates to search, but these are the ones that are specifically related to search together with large language models (like GPT-4.5).

Following the trail of breadcrumbs leads directly to Microsoft as the technology powering any search engine that OpenAI is supposed to be planning… if that rumor is true.

Advertisement

5. Is Rumor Meant To Steal Spotlight From Gemini?

The rumor that OpenAI is launching a competing search engine was published on February 14th. The next day on February 15th Google announced the launch of Gemini 1.5, after announcing Gemini Advanced on February 8th.

Is it a coincidence that OpenAI’s announcement completely overshadowed the Gemini announcement the next day? The timing is incredible.

At this point the OpenAI search engine is just a rumor.

Featured Image by Shutterstock/rafapress

Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address

SEO

Google Limits News Links In California Over Proposed ‘Link Tax’ Law

Published

on

By

A brown cardboard price tag with a twine string and a black dollar sign symbol, influenced by the Link Tax Law, set against a dark gray background.

Google announced that it plans to reduce access to California news websites for a portion of users in the state.

The decision comes as Google prepares for the potential passage of the California Journalism Preservation Act (CJPA), a bill requiring online platforms like Google to pay news publishers for linking to their content.

What Is The California Journalism Preservation Act?

The CJPA, introduced in the California State Legislature, aims to support local journalism by creating what Google refers to as a “link tax.”

If passed, the Act would force companies like Google to pay media outlets when sending readers to news articles.

However, Google believes this approach needs to be revised and could harm rather than help the news industry.

Advertisement

Jaffer Zaidi, Google’s VP of Global News Partnerships, stated in a blog post:

“It would favor media conglomerates and hedge funds—who’ve been lobbying for this bill—and could use funds from CJPA to continue to buy up local California newspapers, strip them of journalists, and create more ghost papers that operate with a skeleton crew to produce only low-cost, and often low-quality, content.”

Google’s Response

To assess the potential impact of the CJPA on its services, Google is running a test with a percentage of California users.

During this test, Google will remove links to California news websites that the proposed legislation could cover.

Zaidi states:

“To prepare for possible CJPA implications, we are beginning a short-term test for a small percentage of California users. The testing process involves removing links to California news websites, potentially covered by CJPA, to measure the impact of the legislation on our product experience.”

Google Claims Only 2% of Search Queries Are News-Related

Zaidi highlighted peoples’ changing news consumption habits and its effect on Google search queries (emphasis mine):

“It’s well known that people are getting news from sources like short-form videos, topical newsletters, social media, and curated podcasts, and many are avoiding the news entirely. In line with those trends, just 2% of queries on Google Search are news-related.”

Despite the low percentage of news queries, Google wants to continue helping news publishers gain visibility on its platforms.

Advertisement

However, the “CJPA as currently constructed would end these investments,” Zaidi says.

A Call For A Different Approach

In its current form, Google maintains that the CJPA undermines news in California and could leave all parties worse off.

The company urges lawmakers to consider alternative approaches supporting the news industry without harming smaller local outlets.

Google argues that, over the past two decades, it’s done plenty to help news publishers innovate:

“We’ve rolled out Google News Showcase, which operates in 26 countries, including the U.S., and has more than 2,500 participating publications. Through the Google News Initiative we’ve partnered with more than 7,000 news publishers around the world, including 200 news organizations and 6,000 journalists in California alone.”

Zaidi suggested that a healthy news industry in California requires support from the state government and a broad base of private companies.

As the legislative process continues, Google is willing to cooperate with California publishers and lawmakers to explore alternative paths that would allow it to continue linking to news.

Advertisement

Featured Image:Ismael Juan/Shutterstock

Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

SEO

The Best of Ahrefs’ Digest: March 2024

Published

on

The Best of Ahrefs’ Digest: March 2024

Every week, we share hot SEO news, interesting reads, and new posts in our newsletter, Ahrefs’ Digest.

If you’re not one of our 280,000 subscribers, you’ve missed out on some great reads!

Here’s a quick summary of my personal favorites from the last month:

Best of March 2024

How 16 Companies are Dominating the World’s Google Search Results

Author: Glen Allsopp

tl;dr

Glen’s research reveals that just 16 companies representing 588 brands get 3.5 billion (yes, billion!) monthly clicks from Google.

My takeaway

Glen pointed out some really actionable ideas in this report, such as the fact that many of the brands dominating search are adding mini-author bios.

Advertisement
Example of mini-author bios on The VergeExample of mini-author bios on The Verge

This idea makes so much sense in terms of both UX and E-E-A-T. I’ve already pitched it to the team and we’re going to implement it on our blog.

How Google is Killing Independent Sites Like Ours

Authors: Gisele Navarro, Danny Ashton

tl;dr

Big publications have gotten into the affiliate game, publishing “best of” lists about everything under the sun. And despite often not testing products thoroughly, they’re dominating Google rankings. The result, Gisele and Danny argue, is that genuine review sites suffer and Google is fast losing content diversity.

My takeaway

I have a lot of sympathy for independent sites. Some of them are trying their best, but unfortunately, they’re lumped in with thousands of others who are more than happy to spam.

Estimated search traffic to Danny and Gisele's site fell off a cliff after Google's March updatesEstimated search traffic to Danny and Gisele's site fell off a cliff after Google's March updates
Estimated search traffic to Danny and Gisele’s site fell off a cliff after Google’s March updates 🙁 

I know it’s hard to hear, but the truth is Google benefits more from having big sites in the SERPs than from having diversity. That’s because results from big brands are likely what users actually want. By and large, people would rather shop at Walmart or ALDI than at a local store or farmer’s market.

That said, I agree with most people that Forbes (with its dubious contributor model contributing to scams and poor journalism) should not be rewarded so handsomely.

The Discussion Forums Dominating 10,000 Product Review Search Results

Author: Glen Allsopp

Tl;dr

Glen analyzed 10,000 “product review” keywords and found that:

Advertisement

My takeaway

After Google’s heavy promotion of Reddit from last year’s Core Update, to no one’s surprise, unscrupulous SEOs and marketers have already started spamming Reddit. And as you may know, Reddit’s moderation is done by volunteers, and obviously, they can’t keep up.

I’m not sure how this second-order effect completely escaped the smart minds at Google, but from the outside, it feels like Google has capitulated to some extent.

John Mueller seemingly having too much faith in Reddit...John Mueller seemingly having too much faith in Reddit...

I’m not one to make predictions and I have no idea what will happen next, but I agree with Glen: Google’s results are the worst I’ve seen them. We can only hope Google sorts itself out.

Who Sends Traffic on the Web and How Much? New Research from Datos & SparkToro

Author: Rand Fishkin

tl;dr

63.41% of all U.S. web traffic referrals from the top 170 sites are initiated on Google.com.

Data from SparktoroData from Sparktoro

My takeaway

Despite all of our complaints, Google is still the main platform to acquire traffic from. That’s why we all want Google to sort itself out and do well.

But it would also be a mistake to look at this post and think Google is the only channel you should drive traffic from. As Rand’s later blog post clarifies, “be careful not to ascribe attribution or credit to Google when other investments drove the real value.”

I think many affiliate marketers learned this lesson well from the past few Core Updates: Relying on one single channel to drive all of your traffic is not a good idea. You should be using other platforms to build brand awareness, interest, and demand.

Want more?

Each week, our team handpicks the best SEO and marketing content from around the web for our newsletter. Sign up to get them directly in your inbox.

Advertisement



Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

SEO

Google Unplugs “Notes on Search” Experiment

Published

on

By

Google unplugs Notes On Search Experiment

Google is shutting down it’s Google Notes Search Labs experiment that allowed users to see and leave notes on Google’s search results and many in the search community aren’t too surprised.

Google Search Notes

Availability of the feature was limited to Android and Apple devices and there was never a clearly defined practical purpose or usefulness of the Notes experiment. Search marketers reaction throughout has consistently been that would become a spam-magnet.

The Search Labs page for the experiment touts it as mode of self-expression, to help other users and as a way for users to collect their own notes within their Google profiles.

The official Notes page in Search Labs has a simple notice:

Notes on Search Ends May 2024

That’s it.

Advertisement

Screenshot Of Notice

Reaction From Search Community

Kevin Indig tweeted his thoughts that anything Google makes with a user generated content aspect was doomed to attract spam.

He tweeted:

“I’m gonna assume Google retires notes because of spam.

It’s crazy how spammy the web has become. Google can’t launch anything UGC without being bombarded.”

Cindy Krum (@Suzzicks) tweeted that it was author Purna Virji (LinkedIn profile) who predicted that it would be shut down once Google received enough data.

She shared:

Advertisement

“It was actually @purnavirji who predicted it when we were at @BarbadosSeo – while I was talking. Everyone agreed that it would be spammed, but she said it would just be a test to collect a certain type of information until they got what they needed, and then it would be retired.”

Purna herself responded with a tweet:

“My personal (non-employer) opinion is that everyone wants all the UGC to train the AI models. Eg Reddit deal also could potentially help with that.”

Google’s Notes for Search seemed destined to never take off, it was met with skepticism and a shrug when it came out and nobody’s really mourning that it’s on the way out, either.

Featured Image by Shutterstock/Jamesbin



Source link

Keep an eye on what we are doing
Be the first to get latest updates and exclusive content straight to your email inbox.
We promise not to spam you. You can unsubscribe at any time.
Invalid email address
Continue Reading

Trending

Follow by Email
RSS