SEO

Bad words list (458 words)

62

Are you creating a new forum, IRC channel, or pretty much any kind of website where users can submit links and you need to control bad words?
Well I was in that situation a few days ago, and it took a very long time until I managed to find some resources, so here I'll try to post a bad words list that I'll keep on building on as I find more sources.

I won't post the bad words list as text, as some public might find it offensive. Instead I'll just post a link to a zip file with the badwords.

Download:

The zip file contains a plain text file with one word per line, and a MySQL file to populate a database with id, word, and replacement.

Change log:

  • Added more words to the list and improved the MySQL one. The list now contains 458 unique words, sorted in alphabetical order.
  • Public release: v. 1.0 - A standard bad words list in a txt file, containing 252 words, plus an SQL script that populates a MySQL database with badwords and their respective "replacement" euphemism.

If you know of any good bad words list please comment its URL, so I can add them to this list!

Best site evaluating tools

0

I guess I'm enjoying writing "Best..." compilations, and well, it is actually a great way to have all your favorite websites all in one place!

In this list I'll list the best site evaluation tools, either SEO analysis, design accessibility issues...
If you know of any great tool that I missed out, please make sure to comment, it's free!
webite value calculator, blog juice calculator, xinu, website grader, crawl tester, w3c

Blog Juice Calculator:

Probably one of the best blog rating online services out there. Learn how much "juice" your site has using this great tool. It checks almost everything that could be possible checked in a blog, and it advices you on how to improve it.

Blog Juice

Website grader:

Another great tool, similar to blog juice but for non-blogs too. It provides you with a grado from 0-100, depending on "over 50 variables". Always good to know if you passed right?

Website grader

Crawl test

Probably one of the best SEO tools here, it crawls your site and it gives you complete feedback on every single page crawled. You need to register though, and the free version limits the number of crawled pages to 5, but still worth using!

SEOmoz

Xinu - Website Analysis

Another SEO evaluation tool, it displays various information about your site collected from all over the web. Very useful tool to know where you must strengthen your efforts!

Xinu

Website value calculator:

This is more of an entertainment tool than an actual SEO tool, but still, it is cool to check... Right now it says I could sell this blog for 86$... :D

Website value calculator

The grand W3C validation services:

Probably the best tool of all when trying to improve accessibility is this one. There are two validators, the html validator and the css validator. (The image points to the html one)

w3c validator (html)

If you know any more great tools that I missed out, please comment!

Writing a good sitemap.xml file for google

3

A sample sitemap.xml fileThe Google XML allows you to inform search engines about URLs on your websites that are available for crawling. Simply, a Sitemap that uses the Sitemap Protocol is an XML file that lists URLs for a site. It also allows web developers to include additional information about each URL ( last update; how often it changes; how important it is related to other URLs in the site) so that search engines can more intelligently crawl the site. This way search engines can crawl your website better.

Google now uses Sitemap Protocol 0.9 as dictated by sitemaps.org. Sitemaps created for Google using Sitemap Protocol 0.9 are compatible with other search engines.

A Sitemap must begin with an opening urlset tag and end with a closing urlset tag. and include a url entry for each URL as a parent XML tag, and a loc child entry for each url parent tag. It must be UTF-8 encoded.

In a Sitemap, urlset, url and loc are required, but the changefreq, lastmod, and priority are optional. urlset is the header and url tag is used before each urls, loc is where the files are located and it has to start with http:// and end with a slash "/", lastmod means when was the last time this URL was modified and the date there must be written according to W3C Date-Time format (YYYY-MM-DD) hour, minutes and seconds can be added too, changefreq means how often this URL is changed (always, hourly, daily, weekly, monthly, yearly and never), and priority means whats the priority of the URL related to the other URLs in this location, the value must be between 0 and 1, so we use a value like 0.8, the default is the 0.5, and changing the value to 1 does not effect the web site's position on Google, just the priority of this URL among other URLs in this location folder.

An Example XML for http://urbanoalvarez.es domain

 
< ?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
 
<url>
<loc> http://urbanoalvarez.es/blog </loc>
<lastmod>2008-03-12</lastmod>
<changefreq>weekly</changefreq>
<priority>1</priority>
</url>
 
<url>
<loc>http://urbanoalvarez.es/portfolio.php</loc>
<changefreq>monthly</changefreq>
</url>
 
<url>
<loc>http://urbanoalvarez.es</loc>
<lastmod>2008-03-12</lastmod>
</url>
 
</urlset>
 

As we see in the example, all we need to do is to use the required tags, which are urlset, url and loc, the others are optional, but of course for a better solution I recommend you to use all of them, make a hierarchy between your website's each URL, insert their priority values according to that and insert their last modified date time and change frequency as told above.

After completing the Sitemap file, I recommend you to compress it with a gzip, and don't forget the Sitemap file cannot contain more than 50.000 URLs and it cannot be larger than 10MB size, so you better compress it. Then it will be a sitemap.gz file. ( Or how you name it, the file extension will be .gz after gzip)

Now where to put the Sitemap file in the site folder ? Don't forget, wherever you insert the sitemap.gz, it can include the URLs in the same folder or in the sub folder. You can't include an URL which is not in the same directory or in an upper directory. With an example, if you insert the sitemap file in the directory, "http://frihost.com/sitemap.gz" we can include all the URLs starting with "http://frihost.com/". But if we insert the Sitemap file in a directory like " http://frihost.com/tools/sitemap.gz", then we can include only URLs which starting with the "http://frihost.com/tools/". So you better insert it in the root, I think it's clear enough.

Now it's time to validate our Sitemap file. Google uses an XML schema to define the elements and attributes that can appear in your Sitemap file. You can download this schema from the link below:

sitemap.xsd

There are some tools to help you validate the structure of your Sitemap. You can find a list of XML-related tools at each of the following locations:

To validate your Sitemap you will need this header in the XML file you have created. It is something like this.

< ?xml version='1.0' encoding='UTF-8'?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9
http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd">
</urlset>
<urlset>
<url> ....
<loc>....</loc>
</url>
</urlset>

If you are using a Sitemap Generator, it probably does this part for you.

Our Sitemap File is ready, in a shorter time Google and other Search Bots will crawl our site, easily, regularly and more often , and our site will get more traffic. It's what we aimed when we started to make a Sitemap.
If you want to make sure that Google crawls your pages, create an account in Google Webmaster Tools, and submit your sitemaps manually.
Original post by paskall, in frihost.com

Cheers

Go to Top