Meta-Generator Documentation

Meta Tag Settings:

Title:
Your sites title.

Author:
Your name or company name.

Copyright:
You copyright notice.

Keywords:
Default keyword list for when extraction is being skipped. You'll probably want to add these to your wanted list below too.

Description:
The description of the site as it should appear on search engines.

Rating:
This is used to give the web page a rating for the appropriateness for kids.
The ratings are, general, mature, restricted, and 14 years.

P3P Compact Policy:
This allows you to add your compact P3P policy to the header information.
More info on P3P can be found here.
Leave blank if unsure!

Robots:
Whether robots should index the page and stop or not index but follow links, etc. index/follow is best.

Pragma:
Turned on, it tells borwser & proxies not to cache your page. Has no effect on search engines.

Bookmark Icon:
IE uses this when somone bookmarks the page to display a cute icon in the browser bar.
Will probably be ../../favicon.ico on any xoops machine.

Extra Header Settings:

Enter any other meta data or javascript you want to appear between your <head> </head>.
DO NOT enter any normal html here, this is only for <meta>, <script>, <style>, etc.

For example, you may want to add your ICRA rating tag in here.
ICRA online generator.

Footer Settings:

Simply replaces the default meta/footer textbox. Use this instead.

Keyword Extractor Settings:

Wanted:
These words are added to the keywords just before shuffling. Like this you can have certain keywords repeated frequently across the entire site. Can also contain 2+ word keywords, and words smaller than 4 letters.

Unwanted:
All these words will be removed from the extracted words as to keep the keywords as relevant to the page in question as possible. No use adding words smaller than 4 letters big here, since they are automaticly stripped anyways.

Note: To add a word, simply fill in the blank box and hit the right/left arrow, to move words between lists, just select the words and use the arrows, and to remove, just select words and hit remove.

Extractor:
This simply turns the dynamic keyword extraction on or off.

Cloaking:
If turned on (recommended), then keywords are not extracted if a normal visitor is detected, thus saving horsepower.

Max. Depth:
How deep you want the extractor to crawl into links. For example in downloads, it could grab all text from xx different downloads present on the page, and compile the keyword list from that.

Max. Words:
Limits the maximum number of keywords to generate.

User Agents:
These are normal browser codes. If HTTP_USER_AGENT does not matches any of these...then we skip extraction since it's not a search engine viewing the page. Cloaking must be turned ON to make usage of this feature.

Import Wordlist:

Browse to any plain text file on your computer containing keywords delimited by spaces, newlines, commas, pipes..and import them into one of the above lists.

Extra:

The below resumes in a brief manner what the keyword extractor does.

If a search engine is detected:

1. Gather as much info about the current page as possible: Title, Description, etc.
2. Place all gathered info into a single string.
3. Remove bad characters, and too small words.
4. Remove any common words.
5. Add any wanted words.
6. Shuffle the words to get a random order.
7. Return max. no. of wanted keywords.
8. Uppercase all 1st letters in words and place words in meta-keywords.
9. Depending on where we are, try and place some extra info in the site title.

We uppercase because a search for:
"xoops" will find "xoops, Xoops, XOOPS"
"Xoops" will find "Xoops" and eventually "XOOPS"

So if a user searches in lowercase he'll still find your word, but if he uppercases his 1st letter you loose... Like this we catch both users that may uppercase the 1st letter of a word AND those that search in lowercase.


If a normal user is detected (no search engine) & cloaking is turned ON, then we skip all the above and display normal keywords, thus saving cpu/mysql consumation.


Extra Links:
http://www.submitcorner.com
http://www.wordtracker.com
http://searchengineforums.com
http://searchenginewatch.internet.com

Not a complete list, but should help you understand whats going on here :o)