21 Essential Latent Semantic Indexing Definitions
Posted by Bruce in LSI Articles, tags: latent semantic indexing, search engine optimiation, seoDefinitions:
The following definitions are the most common ones found in articles written about Latent Semantic Indexing.If your goal is to rank higher in today’s search engine environment, it is important to understand the new concepts being used.This is a “work in progress” as these techniques are continually evolving.
Latent Semantic Indexing: is the latest attempt by Google, and other search engines, to rank websites based on a more natural, or human, approach. Instead of rating sites using keyword density and links, more weight is now being given to sites that are based around a central theme.
Latent: that which is present but cannot be seen.
Semantic: relating to the different meanings of words.
Analysis: an investigation to determine essential features and relationships.
LSI: the abbreviation often seen for Latent Semantic Indexing
Importance: LSI is here to stay so new optimization techniques must be used which conform to this new technology if higher search engine rankings are to be obtained.
Keywords and Themes:
Keyword Stuffing: The practice of using one, or more, keywords over-and-over to trick the search engines into rewarding the website with a higher ranking.
For example: The man walked his dog to the dog park where he saw many other dogs. The overuse of the word “dog” is not natural and is only intended to help the site rank higher for the keyword “dog”.
Keyword Density: the percentage a particular keyword is used in relation to all the words used.
For example: in the sentence above, the word “dog” is used three times out of a total number of fifteen words. The keyword density for the sentence is: 3/15, or 20%.
Themed website: a website built around a central theme using many inter-related keywords.
For example: a themed website about barbecues might be expected to also contain such words as “gas”,” patio”, “starter fluid”, “charcoal”, and “ribs”.
Theme bleeding: a website which contains content not related to the website’s central theme.
For example: a website about travel may contain articles about different countries which, in effect, distract from the website’s core theme.
Importance: old optimization techniques of keyword stuffing and striving for a specific keyword density are no longer useful. Today a new approach is needed–websites must be constructed around themes instead of around individual keywords.
Word Terminology:
Synonym: a word which has the same, or almost exactly the same meaning, as another word.
For example: car and automobile.
Polysemy: a word or phrase which has two or more separate meanings.
For example: “bank” could be a financial institution or a river bank, depending on the context it is used in.
Lexical Database: a database in which words are grouped into sets that relate to a distinct concept. WordNet has a large, free, English lexical database.
Importance: search engines are becoming smarter with the new LSI technology. At one time it was difficult for a search engine to distinguish between words with the same spelling but different meanings–a polysemy–but those days are over. Today websites must be built using a variety of keywords–synonyms, plurals, different tenses of verbs–anything that helps develop a central theme. The use of lexical databases and other tools can help collect these necessary terms and phrases.
Links:
Inbound Links: a hyperlink on a separate website which points to your site.
Outbound links: a hyperlink on your website which points to a different website.
Reciprocal Linking: the mutual exchange of links between two websites. In the case of reciprocal linking your site would have both an inbound link from, and an outbound link to, a particular website.
Anchor Text: the clickable text in a hyperlink. If the anchor text is clicked on it will take you to a new site.
For example: instead of a hyperlink in the form of
http://www.LatentSemanticIndexing.com
the anchor text could simply be: LSI
Importance: The days of reciprocal linking are over. It is now imperative to develop inbound links to a number of your site’s internal pages while using a variety of different anchor text.
Search Methodology:
Algorithm: a fixed list of distinct instructions to follow; a formula. The search engines use algorithms to determine which web pages they will return for a search made on a particular keyword or keyword phrase.
Boolean: a search method used by search engines which uses three logical operators, “or”, “and”, and “not”.
For example: there could be a search for “dog”, “dog and cat”, or “dog or cat”. This type of search does will return websites based on keywords, not themes.
Taxonomy: the science or technique of classification and categorization.
Dynamic Taxonomy: the process for searching and retrieving information from large, diverse, databases.
Ontology: a rigorous and exhaustive organization of a particular field of knowledge that is usually hierarchical and contains all the relevant entities and their relations.
Importance: these terms are mainly for reference and may be useful for understanding articles about search engine methodology.

Entries (RSS)
This is interesting. Is there any site or information about the list of synonym or related to keywords that LSI will use? For example, for the old SEO method, I want to target “Human resources outsourcing”, so for LSI, which should be a groups of words that will help search engine understand that my site is related to “human resources outsourcing”?
Google actually shows us what keywords are related and they are shown in order of relevance! Just go to the Google Keyword Tool, enter your keyword phrase, and then look at the results. On the left side it will say: “Keywords related to term(s) entered-sorted by relevance”. These are the ones I would use scattered throughout the site.
Bruce