The Search Engine Professionals at Rank for $ales.com --- In business since 1997.
Back to our Homepage SEO Tips that will make a big difference in your rankings and our most popular ** How To ** section The most common myths about SEO -- Read what the experts have to say about today's most common SEO myths and misconceptions Frequently Asked Questions to Search Engine Optimization and Positioning Search Engine Optimization Industry News -- Stay in tune with the most recent developments in search engine technology and the SEO industry Contact Rank for $ales today and get your site's rankings high in the engines-- Right where they should be!

  SEARCH FOR:   CITY or STATE:

Search this site


Search tools need to be refined

August 9, 2004

Even if search engines have vastly improved access to information, search tools need to be refined to make it easier to cull information or conduct queries.

That was the word from researchers and speakers at the New Paradigms for Using Computers Conference, held at IBM's Almaden research lab here last week.

"We live in a world with lots of information but also lots of interruptions. It is a teriyaki of information. The question is, 'How do we survive in the marinade?'" joked Dan Russell, senior manager of user sciences and experience research at IBM Almaden.

Early attempts to better locate the world's information are already under way. The University of California at Berkeley, for example, showed off at the conference a prototype of a search engine called Flamenco that makes it easier to search for works of art or antiques. Santa Clara, Calif.-based Inxight, meanwhile, has created software that attempts to graphically represent latent connections between people or institutions by studying where and how they get mentioned on the Web.

On the desktop, companies such as Ingenuity Software, founded by former Apple Computer developer Bruce Horn, are creating tools designed to make it easier for people to index their photos and documents for subsequent Google-like searches on their hard drive.

These research efforts are in addition to new operating systems under development that will include better search tools.

Microsoft plans to add better search features to a future version of Windows, code-named Longhorn, due sometime around 2006 or 2007. The software giant last week demonstrated a more general Web search "service" that's also in development.

And Apple's Tiger, a new version of the company's Mac OS X operating system that's due next year, will include a new systemwide search engine called Spotlight that will allow Mac users to quickly search and find any file, Apple says.

One of the surprises that has emerged from the Internet Archive, which is intended to become a repository of everything ever published, is that the body of public works can probably be corralled, said Brewster Kahle, founder of the organization.

About 100 million different books have been published in history, Kahle said, citing estimates from professor Raj Reddy at Carnegie Mellon University. About 28 million sit in the Library of Congress. On average, a book can be condensed to a megabyte in Microsoft Word. Thus, the books in the Library of Congress could fit into a 28-terabyte storage system.

"For the cost of a house, you could have the Library of Congress," Reddy said, adding that mass book-scanning projects are currently under way in India and China.

"Universal access to all human knowledge is within our grasp. It could be one of the greatest achievements of all time." -- Brewster Kahle, founder, Internet Archive.

Only about 2 million to 3 million audio recordings--mostly music--have ever been published for public consumption. The Internet Archive has begun to store digitized recordings of concerts as well and has about 15,000 shows in its database to date. There are between 100,000 to 200,000 theatrical movies--half of them from India--in existence and about 20 terabytes of TV broadcasts a month. The Web grows by about 20 terabytes of compressed data a month as well. (One terabyte equals 1 trillion bytes.) Since 1984, about 50,000 software titles, including CD-ROMs, have emerged.

Though the legal issues around storing and viewing all this information remain thorny, storing it is doable.

"Universal access to all human knowledge is within our grasp," Kahle said. "It could be one of the greatest achievements of all time."

Still, that's a lot to grasp. Similarly, individuals will experience an explosion in their personal catalogs of data. In the MyLifeBits project under way at Microsoft Research, noted scientist Gordon Bell is attempting to digitally capture all of the books, movies, TV shows, music and other media he has experienced in his life. He's up to 44GB of data so far.

E-mails, phone messages, photographs and personal video will also add to an individual's data trove. In another experiment, doctors in Cambridge, England, have equipped patients suffering from severe memory loss with a Microsoft SenseCam, a wearable camera that takes pictures when a person moves. One man is currently using it so he can show his wife, who has memory problems, a diary of the day, said Ken Wood, who works on the project.

Microsoft has also entered a three-year alliance with the Edinburgh International Festival in Scotland. In a likely experiment, attendees will wander about the arts fest with SenseCams around their necks, snapping shots.

One approach to mastering data overload lies in developing search engines specialized for certain topics and data sets. That's the tack taken by Berkeley's Flamenco project.

In Flamenco, a Yahoo-like interface categorizes artworks drawn from museum collections around the world by content (animals, heaven and earth, shapes and colors, and so on), century, artist, medium (such as painting, furniture, sculpture) and other identifiers. By going up and down the tree, users can browse through all the animal pictures found in the database, or they can zero in on, say, the years 1700 to 1709 and discover that the period, at least as represented by the database, produced only four paintings of hoofed mammals.

The search engine does not search on the visual information contained in the picture, said Kevil Li, a student on the project. Instead, searches are conducted on descriptive text submitted by the museums that digitize their artwork for such databases.

Other tools, such as Inxight and GeoFusion, produce graphical representations of data obtained through searches. GeoFusion, which makes software that can extrapolate from geographic data, was able to render a map of the movements of a tagged tuna.

By contrast, Inxight's software creates a map of relationships between names and topics. A search on the White House and business showed that Haliburton is the corporation linked most often to the White House. In a similar fashion, IBM's own WebFountain project is used to test how cohesive certain blogging communities are by how quickly and in unison they react to news events.

File systems will likely begin to disappear as search gains popularity. One of the phenomena that Microsoft researchers are finding in MyLifeBits is that files are largely ad hoc categories that become outdated, said Jim Gemmell at Microsoft Research.

Instead, data should be tagged so that if people remember a name or part of a name, they can find their way back to documents or pictures involving that person, or they can find documents created on the same day that they had a phone conversation with the person, even if the discussion involved something unrelated.

"The problem is not that we keep too much with MyLifeBits. The problem is how to use it," Gemmell said.

Poorer nations will also be able to take advantage of these advances, even without an electrical grid. The Internet Archive has created mobile bookmobiles in conjunction with Hewlett-Packard and others. The bookmobiles contain a printer hooked up to a satellite feed, which can print books for kids. Two are in operation in India, while another in rural Uganda prints about 1,500 books a week. The entire bookmobile, including the cost of the used van, is $15,000, and 100-page books cost about a $1 to print and bind in the van.

"It takes about 12 to 15 minutes to make a book," he said. "It is cheaper for a library in the United States to print and give away a book than retrieve it."

Source: C-Net News


Back to the top of the page.         
Drop your e-mail address
& get our free weekly newsletter

Read Serge Thibodeau's daily blogs on search engines at Serge Thibodeau Live. We strongly suggest you bookmark our web site by clicking here.

Tired of receiving unwanted spam in your in box? Then get SpamArrest™ and put a stop to all that nonsense. Click here to get all the details.
Tired of receiving unwanted spam in your in box? Get SpamArrest™ and put a stop to all that SPAM. Click here and get rid of SPAM forever!

Get your business or company listed in the Global Business Listing directory and increase your business. It takes less then 24 hours to get a premium listing in the most powerful business search engine there is. Click here to find out all about it.

Rank for $ales strongly recommends the use of WordTracker to effectively identify all your right industry keywords. Accurate identification of the right keywords and key phrases used in your industry is the first basic step in any serious search engine optimization program. Click here to start your keyword and key phrase research.

Pay Rank for $ales securely with your Visa, MasterCard, Discover, or American Express credit card through the secure PayPal network. (Note: PayPal is an eBay company, and maintains a net free capital of US $ 50 Million).
VisaMasterCardDiscoverAmerican Express

You can link to the Rank for Sales web site as much as you like. Read our section on how your company can participate in our reciprocal link exchange program and increase your rankings in all the major search engines such as Google, AltaVista, Yahoo and all the others.

Powered by Sun Hosting                  Sponsored by Avantex          Traffic stats by Site Clicks™

Site design by Mtl. Web D.         Sponsored by Press Broadcast         Sponsored by Blog Hosting.ca


Call Rank for Sales toll free from anywhere in the US or Canada:   1-800-631-3221
email:   info@rankforsales.com

| Home | SEO Tips | SEO Myths | FAQ | SEO News | Articles | Sitemap | Contact |


Copyright © Rank for Sales 2003    Terms of use    Privacy agreement    Legal disclaimer

       Ce site est disponible en Français