GCN Tech Blog

By GCN Staff

Blog archive

Programming the search engine

As we reported awhile back, Google should introduce a new technology later this year, called the Programmable Search Engine, that could significantly change how Internet search is done.

Reading over some of the patents around the PSE Google was recently issued offers a glimpse into this new world.

What Googzilla wants to do, in a nutshell, is bring a bit more precision to search when it comes to vertical sites. 'Vertical site' is a marketing term for any specialty Web site with a lot of material, such as Amazon.com, or the Environmental Protection Agency or, for that matter, Government Computer News.

These days, such sites can offer a wealth of information on a particular set of topics, though in most cases, the search engines these sites offer are still pretty sketchy (ours included). You can find what you are searching for'and a little more.

For instance, if you go to IRS.gov to search for a Form 1040x you may need, you will get back a link to a PDF version of the form, as well as a number of other links which may or may not be useful to you.

Google wants to tighten that up. Imagine, in the future, if you look for the 1040x, you get back a list of options that clarify what you are truly seeking. Do you want to download the form? Do you want instructions on how to fill out the form? And so on. In fact the site may already have a pretty good idea of what you need, based on your history of surfing and preferences submitted by the site itself.

'When the user issues a query associated with a particular context,
either by clicking on a particular link or performing the query from a
particular vertical content site, he or she is effectively communicating
a piece of information about his or her current activity and/or about
him- or herself,' the PSE patent reads.

What happens in the background for these types of searches is that the organization's Web site hands the query off to Google, along with a file that may offer some insight about what users really want when they submit searches for terms like 'Form 1040x.' Google may have its own files that could help further clarify the context of the query as well.

'The pre-processing operations include operations to revise, modify or expand the query, to select one or more document collections on which to
conduct the search, to set various search algorithm parameters for
evaluating the query, or any other type of operation that can refine,
improve, or otherwise enhance the quality of the user's search query," the patent again.

Of course, this service will require some work on the part of the vertical site manager to build these files. And as agency Web site traffic increasingly comes from Web sites (Google, mostly), managers should consider the full implications of having your information so finely parsed and presented to the outside world, as other 'vertical' fields of endeavor are starting to do.

Posted by Joab Jackson on May 30, 2007 at 9:39 AM


Featured

  • Defense
    Ryan D. McCarthy being sworn in as Army Secretary Oct. 10, 2019. (Photo credit: Sgt. Dana Clarke/U.S. Army)

    Army wants to spend nearly $1B on cloud, data by 2025

    Army Secretary Ryan McCarthy said lack of funding or a potential delay in the JEDI cloud bid "strikes to the heart of our concern."

  • Congress
    Rep. Jim Langevin (D-R.I.) at the Hack the Capitol conference Sept. 20, 2018

    Jim Langevin's view from the Hill

    As chairman of of the Intelligence and Emerging Threats and Capabilities subcommittee of the House Armed Services Committe and a member of the House Homeland Security Committee, Rhode Island Democrat Jim Langevin is one of the most influential voices on cybersecurity in Congress.

Stay Connected

Sign up for our newsletter.

I agree to this site's Privacy Policy.