Web Sites | Web Squads | Web Tools | Web Hooks | destination-e


home | search & site map


www.thinking.com.au

Search Information
  
1. What is a search engine?

Search engines utilise indexing software agents often called robots or spiders. These agents are programmed to constantly "crawl" the Web in search of new or updated pages.

When visiting a Web site, an agent will record the full text of every page (home and sub-pages) within the site. It will then continue on to visit all external links.

Robots will then revisit your site periodically to refresh the information they already have in their database.

top


2. How do you get listed in a Search Engine

Each search engine looks at different elements of your page, and some or all are important to each search engine.

a. Use keywords in the <TITLE> of your document making it as descriptive as possible. When visiting your site, an agent will go first to the <TITLE> tag, (when you open a browser it displays the title in the top bar). Search engines will display the text located between the <TITLE> tags when your web page is listed in a search. By making your <TITLE> descriptive, you'll be better off than those who only have keywords within the text of their page. It will also be helpful when people bookmark your web site. If a more descriptive name appears in a person's hotlist, it will be easier to find your site at a later date.

For example, instead of using <TITLE>Thinking </TITLE> as the title of Thinking's home page, <TITLE>Thinking Internet Management</TITLE> would be much more descriptive. It would also place greater emphasis or relevancy on "Internet Management" when calculating keywords.

The HTML title is the link name of the page as it appears on lists of results. "No Title" doesn't really attract people to click to go to your page. And having multiple pages with the same title is confusing and looks unprofessional.

In the ranking rules that determine which pages will appear near the top of a list of matches, the HTML title is the most important element of the page.

When people search for your kind of information, they are most likely to use those words that belong in your HTML title. When you leave the title blank, use unimportant words, or use the same title on many different pages, you've just lost an opportunity of attracting people to you site.

b. Descriptive Page Text Search engines assign greater relevancy to text located at the top of a page than to text located in the middle or at the bottom of the page. The search engines assume that web page authors will present their most important information first. If your page has a main graphic at the top, you should place some descriptive text either underneath or beside the image. The search engines will index this text and assign it a high level of relevancy.

c. Use <META> tags which allow you to provide even more detail about your Web pages and thereby gain greater control over how your pages are indexed. Not all search engines make use of <META> tags, but adding these tags to your pages will make them more accessible to the search engines that do.

<META> tag codes are inserted within the <HEAD> </HEAD> tags. The basic syntax is:

<META name="description" content="We use search engine optimisation as a weapon, not merely a tool">

This will control what appears as the summary of your Web page and will be displayed after the title of your document in the index listing. The content of the description should clearly convey what one can expect to find when linking to your site.

<META name="keywords" content="search engine optimisation, keywords, search engine ranking, listings, thinking, internet management, internet marketing tool, search squad, specialists ">

This will allow you to provide extra information about your page to the search engines without it being visible to the reader. While search engines do take these keywords into account when indexing your page, they are still going to index the entire contents of your page as many sites do not include <META> tags.

Beware: do not, excessively repeat keywords in a keyword <META> tag, as search engines may penalise you for this. InfoSeek and Lycos are two such examples and others may adopt similar policies in the future. The penalty will most likely be the spider disregarding the <META> tag and extracting keywords from the content of your page.

d. If your site uses frames, you should be aware that search engines treat frames as if they are links within your main page. As a result the engines will review and index your main page and, at a later date, return to index each individual frame just as it will return to index all other internal links within your web site. Therefore, in order to have your main page  indexed accurately and efficiently, we recommend that you add some descriptive text between the <noframes> and </noframes> tags of the HTML source coding of your main page. The noframes tags are usually placed below your frame set information. The frame set information is designated by <frameset> and </frameset>. This text should include your most important keywords and keyword phrases. Adding this text will provide the search engines with content from which to derive keywords for indexing. After this change has been made to your Web site, the page itself will appear exactly the same to anyone using a browser that supports frames. However, users of browsers that do not support frames (i.e. Netscape 1.0 or lower) will now be able to successfully view your home page.

top


3. Indexing Characteristics of Individual Search Engines

Excite
At the present time, Excite does not make use of <META> tags. Since keywords and summaries are automatically generated by Excite, you have less control over their creation. However, there are still a few things you can do.

Excite's software looks for common words or themes within a page. It then selects sentences for the summary that either contain these words or convey the overall theme. The words within these sentences are also used as keywords for which the site can be searched.

  1. Especially at the beginning of your page, be as concise as possible and limit non-descriptive sentences. If the Excite robot comes across a number of ambiguous phrases, it will have to look deeper and deeper into your site to determine its theme and site summary. Along the same line, too little text will also force the robot to travel further into the site for more information in order to establish a theme.
  2. Excite's indexing software places preference on complete, punctuated sentences. If you have content, such as a quote, at the top of your page that you do not want Excite to include in your site summary, do not display it as a complete sentence. This will lessen the chance that the quote will be included, but will not guarantee its exclusion.
HotBot / Inktomi
  1. HotBot supports both the keywords and description <META> tags.
  2. If you strongly believe that your site was not ranked as high as you thought it deserved in a search query, HotBot allows you to send them an email to bugs@hotbot.com. Be sure to include the URL of the search page.
InfoSeek
  1. InfoSeek supports both the keywords and description <META> tags. Your description can include up to 200 characters of text and the keywords can include up to 1000 characters of text. Do not repeat versions of a keyword more than seven times. If you do, InfoSeek will disregard the entire keyword list.
  2. If you do not make use of the description <META> tag, InfoSeek's agent will simply insert the first 200 characters after the <BODY> tag as the web page description. Hence, if your Web document does not contain <META> tags, at least try to make your first 200 words accurately describe your Web site.
  3. InfoSeek also indexes the ALT attribute in the <IMG> tag. If the majority of your home page consists of graphics, you can describe your page with the ALT attribute.

The syntax for an <IMG> tag code is:

<IMG SRC="/images/clinton.gif" ALT="picture of President Clinton">

InfoSeek Ultra
InfoSeek Ultra will make use of <META> tags allowing you to control the description that appears in a search result as well as guide its web indexing in the selection of your site's keywords. If you do not make use of <META> tags, Ultra will simply use the first few words it comes across as your Web site summary.

Lycos
Lycos creates Web site titles and descriptions from the text of your Web page. Lycos' search agent selects a portion of the site that well represents its theme. It then displays this section as the site's description.

The keywords are also chosen via artificial intelligence by Lycos' spider. With this in mind, do not open your page with an image map. If you do, Lycos will not be able to take an abstract for your document.

WebCrawler
WebCrawler relies on the statement within the <TITLE> tag to use for the name of your page. While other search engines will derive a summary from the <BODY> text of the document, Web Crawler will default to the URL if you fail to include a title.

In conclusion, do not make it your goal to appear in the top ten list of every search engine. This would not only be a very difficult task, but would most likely end in disappointment. You are virtually guaranteed to have varying success rates with different search engines due to the collection of variables that play a role in every search result. These variables include size of database, method used for determining relevancy, policy on spamming, use of <META> tags and more. Hence, optimise your listings by capitalizing on the indexing criteria shared by search engines. If you have a favorite or preferred search engine that you feel strongly about being yielded as high as possible in a search, customise your Web pages accordingly. Just keep in mind that this might lessen your perceived relevancy on another.

top


4. How long does it take a Search Engine to list my site?

Approximate times:
1-2 weeks: Altavista, Infoseek
2-4 weeks: Excite, HotBot, Lycos, Webcrawler
6-8 weeks: Yahoo

If after the time listed in the chart above has elapsed and you are still not finding your site listed, you should re-submit your URL to the search engines that do not have your listing. It often takes more than one submission to get the best results.

Search engines receive tens of thousands of submissions every day. On days when the submission level is above normal, a certain number of submissions can be received properly but still not get listed. If your listing is not appearing on a certain search engine, you should resubmit it.

The following links take you to the Submit pages of the top 10 search engines used by Australians.

Altavista
Excite
Lycos
MSNSearch
Looksmart
Google Search
NineMSN
WebWombat
Yahoo

top


5. Should I submit inside pages of my site?

When considering how many pages of your site to promote, it is important to remember directories generally accept one listing per company. This is usually your home or main page. Search engines index multiple pages within a site, but you cannot control which pages they will index unless you submit the individual URLs (pages).

The critical pages to submit are your major topic pages, pages with unique content, or pages that describe a specific product or service. For example, a sports store will have separate pages on basketball, baseball and football which should all be submitted. Another benefit to submitting multiple pages in this example is that the basketball page will appear higher in a search for "basketball" than the sports site home page. These pages should also include descriptive title tags and meta tags. In order for site visitors to easily navigate your site, these major topic pages should provide clear links to other topic pages and your home page.

top


6. When will I need to inform search engines and directories of updates to my Web pages?

Here are some general rules and indicators for knowing when you need to make updates to your listings. For more detailed information on the process involved in making updates to each directory, consult the Submission Hints and Tips accessible within the Submit It! Gold and Pro versions.

For search engines: Make updates to listing:

  • If your URL changes, you will need to resubmit. Some search engines have Dead Link forms for you to fill out. Those that do not will drop the old URL from its records the next time it tries to visit your site at the old address and is unable to find it.
  • If the content of your site changes so extensively that the abstract no longer adequately describes your site. These changes, however, will be taken into account when the indexing agent next visits your site to refresh its information. Resubmitting will hasten this process.
For directories: Make updates to listing:
  • If your Web site address (URL), description, or category classification changes, you will need to inform the webmaster of that directory.
  • If your category classification has changed and you are now inaccurately located within the directory's index.
Do not make updates to listing:
  • If there has been minor changes made within the site which do not alter the site's description. If the content or purpose of the site remains the same, you are not misleading anyone who chooses to visit your site based on the original description.

top


7. "Non-Netiquette" Things

There are several "tricky" things you can do to try to get your URL listed higher or more times on a search engine results page. We are listing these alternatives, but are in no way recommending them as these Web site promotion methods may be viewed as "cheating" by some members of the Internet community. In fact, some of the search engines are starting to penalise people that use these tricks. The reason why we list these tricks is to help you understand why some sites always show up multiple times or always at the top.

  • Placing mass amounts of hidden keywords <!software manufacturer, software manufacturer,...> at the bottom of your document. Search engines calculate keywords by how many times they appear on a page. It is important to be aware that search engines may penalise you for excessively repeating keywords. At present, InfoSeek and Lycos are two examples and others may adopt similar policies in the future.
  • Changing your name to A1 Enterprises or !Rob's Restaurant in order to appear at the top. Our suggestion is that you pick something descriptive instead. You might appear at the top by changing your name, but be skipped over because the new one does not evoke a professional or trustworthy company image.
  • Changing your <TITLE> regularly so when robots revisit your site to refresh their information, they will interpret the new title to mean the existence of a new site. The result being your page will be listed more than once in a search.

top


8. Query Relevance

There are hundreds of millions of Web pages, so almost any query is likely to have a huge number of matches. For search results to be useful, search engines must rank more highly pages that are most likely to have relevant information. Search Engines formula for doing that is a closely kept secret (like the formula for Coca-Cola), and is subject to continuous fine-tuning. But an understanding of the main ingredients can help you build pages that will be valued by search engines and hence found by people who use them.

Content counts; content near the top of a page counts more than content at the end. In particular, the HTML title and the first few lines of text are the most important part of your pages. If the words and phrases that match a query happen to appear in the HTML title or first lines of text of one of your pages, chances are very good that that page will appear high in the list of search results.

Say you want to put your resume on the Web. Keep this rule in mind: Don't put your name first. You aren't trying to be found by people who already know you. You want to be found by people who have never heard of you. So don't waste any letter in the HTML title on your own name. The first word should be "resume." After that, list your main qualifications and the kinds of jobs that you are looking for. Put the same kinds of things in the first lines of text. That's what will come up as the default as the description in match list, and it's also an important position for ranking.

top


9. Spamming

Some barriers to being indexed are due to the misbehavior of a handful of webmasters who have tried to fool search engines into ranking their pages high on lists of matches and including them as matches to queries they aren't appropriate for. This is one kind of behavior that is known as "spamming." Spamming degrades the value of the index and is a nuisance for all.

The logic that leads people to try such tricks is rather bizarre. "I figure everybody searches for the word 'sex.' I don't have any sex at my site, but I want people to stumble across my site. So I'm going to put the word 'sex,' three thousand times as comments. And any time that anybody searches for 'sex,' my pages will show up first."

People have actually tried that. They have tried doing the same kind of thing in the backgrounds of their Web pages. They have also created page after page of text that is in the same color as the background color so visitors won't see the words, but search engine crawlers will. They have tried everything imaginable to fool search engines.

If being found via search engines is important to your business, be very careful about where you have your pages hosted. If the hosting service also hosts spammers and pornographers, you could wind up being penalised or excluded simply because the underlying IP address for that service is the same for all the virtual domains it includes.

top

Search Squad
Topics

  Search Optimisation
  Search Information
  Clients
  Degrees of Difficulty
  Test Your Website
  

Web Squad
Topics

  00 web squads
  01 think squad
  02 search squad
  03 content squad
  04 report squad
  05 audience squad
  06 email squad
  07 site squad
  

   

the company about us
opinion
newsletter
the work portfolio
service
tools
the site home
search & site map
pay online ssl payment form

  
Privacy Statement

www.thinking.com.au
  
Thinking Australia Pty Ltd.
All rights reserved.
© 2003

Thinking is an Australian Government Endorsed Supplier