Traffic building
Online radio – Setting it up & hosting it
0Hi guys,
I'm sure someday we will work for someone who wants an online radio statio, well, this is a quick guide to those
Online radios:
Basically it is audio being broadcasted via the Internet. So that means that we generate an audio stream that listeners cannot control, other than pausing and resuming.
To launch one we need several things, to start off, we need a server with streaming capabilities. Since setting up a streaming server might be difficult, we will use a standard radio management program. The main ones are SHOUTcast, IceCast, Live365...
I will focus on SHOUTcast, since it is the most extended among web hosts.
Radio server: SHOUTcast
First off, we need a server to host our radio. Radio servers are not the same as standard web hosting servers, since they require a static IP for the radio, and streaming capacity. Most radio servers use SHOUTcast to manage the radio, and I personally recommend it for its easy of use.
From the SHOUTcast documentation:
"The magic of the SHOUTcast Radio system happens inside the SHOUTcast Radio Distributed Network Audio Software (DNAS). This software runs on a server attached to your IP network with lots of bandwidth, and is responsible for receiving audio from a broadcast source, updating the SHOUTcast Radio directory with information about what the broadcaster is sending, and sending the broadcast feed to listeners"
Finding a good hosting solution:
There are plenty of radio hostings out there, and finding the best one is almost impossible. There are some key things that we must always take into account:
- Bitrate: This means bits per second, or kilobits normally (kbps) It represents the audio quality, and the higher the better. 128kbps should be perfect for most stations.
- Bandwidth: This is the same as a standard web hosting server, but you must realize that the radio will be constantly streaming audio, so a huge bandwidth will be needed. 100Gb should be the least, and 300-500Gb will be perfect.
- Listeners: Online radios have a simultaneous listener limit, and that is a key factor when choosing a hosting plan. I strongly recommend a minimum of 50 listeners, although if you plan on growing 100-300 should be ok to start.
Disk space and other features such as PHP or MySQL are important as well, but nothing compared to the ones above. An ideal radio hosting plan should cost between 20-50$, have 128kbps, 300-500Gb, 500Gb bandwidth limit, and 400-600 listeners.
The one I prefer the most is GlowHost, which offers a great solution for SHOUTcast hosting, starting with a basic package at 25$/month, and 250 listeners.
Broadcasting
The nice thing about all this: Broadcasting from your home. And it is as simple as getting a program that works with broadcasting, there are 3 main ones:
- WinAmp: (Free) A standard audio player, that will let you play music to your radio as well as talk though a mic.
- Virtual DJ: (Commercial) Quite more sophisticated that WinAmp, it will allow you to do more complex mixes of the music, as well as talk via the mic.
- SAM Broadcaster: (Commercial) Probably the most professional solution for online radios, packed with features and goodies!
If you are going to broadcast from WinAmp you will first need to install the DSP plugin.
Also, take into account that when prompted for name and password, you can use any name you want, and then in the password field, you must write your username : and the password (i.e. username:password)
Important features
When setting up an online radio station it is vital to ensure that there will be something playing 24/7, so something like AutoDJ for SHOUTcast is perfect. AutoDJ will take all mp3 files in a folder you specify, and play them as a playlist any time there is no one broadcasting in your station, so it will fill up perfectly the spaces between shows and the times when there is no one available.
It is enabled in most radio hosts by default, and if it isn't you can normally request it free of charge. Setting it up is very easy, and it only requires having some mp3 files in a folder, that's why it is convenient to have enough disk space.
Playing the radio
SHOUTcast and most radio software include a set of scripts that will play the radio online, as well as showing information about current song, listener number... But they are not the best.
Ideally, you will include the radio in your website using a flash player, so that you can blend it nicely in the design. For this task I recommend the JW Player.
JW Player for online radio
In order to use it you must make sure SHOUTcast is set to stream in mp3. And then go to the link "Setup Flash Player". In the code they give you, you will find the radio address for the mp3 stream, it will look like this:
<div id="container"><a href="http://www.macromedia.com/go/getflashplayer">Get the Flash Player</a> to see this player.</div> <script type="text/javascript" src="swfobject.js"></script> <script type="text/javascript"> var s1 = new SWFObject("player.swf","ply","328","20","9","#FFFFFF"); s1.addParam("allowfullscreen","true"); s1.addParam("allowscriptaccess","always"); s1.addParam("flashvars","file=http://IP:PORT/;stream.nsv&type=mp3&volume=50&autostart=true"); s1.write("container"); </script>
Where it says IP and PORT, the corresponding IP and PORT of your radio will appear. You can directly embed this, or customize it. In order to do so, you can use the skings in the JW Player's website, or simply create yourself one, providing you have enough knowledge of Flash.
You're done!
If you still have any problems setting it up, or broadcasting, feel free to ask in the comments section below
Measuring the quality of your blog
0Although there are a lot of online resources to measure the wuality of your blog, like Blog Juice, Website grader (The bottom of the resports is dedicated to blog) and others like Technorati Authority, all of this services focus on popularity and sometimes SEO aspects.
Taking into account that all a blog is about is people reading your content, the best way to measure your site's quality is to use Google Analytics and check the time spent per visit, and the time spent per page. This way if you see 90% of the visits spend less that 10 seconds on your page, who cares if your traffic is 10k, your blog does not have quality. It may be the design, the content, something that is making the users leave even before they had time to know what it was about.
As you can see in the image, you have an average, a daily basis, and a visual graph displaying the variations. An ideal time would the one it takes the user to read the article and maybe post a comment. Most users will take about a minute to do all that. If your average is under a minute, then you must do something, go through the stats, page by page, and check how much time they spend on each page, that gives you an insight of what's happening.
Also check the main exiting pages (Which will usually be the same ones they entered) and sometimes you'll see a particular page being exited too much, it is usually because of the article title, an egaging title will keep your visitors on the site, at least long enough to determine whether your article is interesting or not.
Something that is usually a great way of keeping your users on the site is putting an image always on each article. It gives a visual representation of what the article is about, and as you know, an image is always understood faster than a whole paragraph.
Another great way to help the reader know what the article is about is highlighting important words. It helps the user know where to look when skimming through the article.
Follow this simple tips and you'll increase the time users spend on your blog, which hopefully will mean that they will start to read your articles from start to end!
Writing a good robots.txt file | SEO Tips
3Search Bots, crawl each URL and the first thing they search on an URL root is the robots.txt file. So if we make our robots.txt file, we can change the Search Bots' behaviours, and we can tell them where to search and publish and where to not. Imagine we have privacy folders in our website, for example if we have folder or a file containing e-mail addresses we don't want published, we can avoid Search Engine robots' visits using a few simple commands on the robots.txt file. Here we go:
Introduction
We use the /robots.txt file to give instructions about our site to web robots; this is called The Robots Exclusion Protocol.
Simply, the robots.txt is a very simple text file that is placed on our root directory. For example http://urbanoalvarez.es/robots.txt. This file tells search engines and other robots which areas of our site they are allowed to visit and index.
The only thing you must take into account is that ONLY one robots.txt file is allowed on our site and ONLY in the root directory (where our home page is)
TRUE: http://urbanoalvarez.es/robots.txt (Works)
FALSE: http://urbanoalvarez.es/images/robots.txt (Doesn't work)
All major search engine spiders respect this file, but unfortunately most spambots (email collectors, harvesters) do not. If you want security on your site or if you have files or contents to hide, you have to actually put the files in a protected directory, you can't trust the robots.txt file only.
Setting up our file
So what programs we need to create it? Just the good ol'notebook or any text editor program. All we need to do is to create a new text file, and rename it! Attention, the name has to be "robots.txt", cannot be "robot.txt" or "Robot.txt" or "robots.TXT".
Simple, no Caps and robots!
Writing the rules
Now that we are starting to write in it, a simple robots.txt looks like this.
User-agent: * Disallow:
The "User-agent: *" means this section applies to all robots, the wildcard "*" means all bots. The "Disallow: " tells the robots that they can go anywhere they want.
User-agent: * Disallow: /
A wildcard "*" is used in this one too, so all bots must read this. But in this one, there is a little difference, a slash "/" in the Disallow line, which means "don't allow anything to be crwaled", so the bots won't crawl you website, the good ones of course
If we want all the bots to read this text file, we should insert a "wildcard (*)" in the User-agent line. And when we leave the Disallow: line blank, it means come crawl my site you bots!, and when there is a slash it means keep out! Simple. This is the simplest way, now we can learn how to keep some bots crawling and some not.
Advanced rules
The User-agent line is the part we are going to work on to define the bot's identity and behavior. For example if we want the google bot to crawl the site but the yahoo bot not, how will our text file look?
User-agent: googlebot Disallow: User-agent: yahoo-slurp Disallow: /
In this sample, we called the googlebot and left the disallow line blank so we said crawl my website. And in the second line we called the yahoo bot but in the disallow line we have a slash so we wanted it to go away.
Now we are going to learn how to avoid some folders of our site getting searched by search engine spiders and how to get some folders to be searched at the same time. For this, we will change the values in the disallow line. For example we have two folders in our domain, /images, and /emails. We want /images to be searched but /emails not. Then the text file would look like:
User-agent: * Disallow: /emails/
As we can see, we called all the robots to read this, and we don't want the /emails folder to be seen, we excluded it but the rest of the website can be crawled by the robots.
Common samples
Here are few samples to make it clearer.
To exclude all folders from all the bots:
User-agent: * Disallow: /
To exclude any folder from all the bots:
User-agent: * Disallow: /emails/
To exclude all folders from a bot:
User-agent: googlebot Disallow: / User-agent: * Disallow:
To allow just one bot to crawl the site:
User-agent: googlebot Disallow: User-agent: * Disallow: /
To allow all the bots to see the all folders:
User-agent: * Disallow:
Important tips
After learning these, I believe you guys got it. Now there are a few rules that we should know. We can't use a wild card "*" in the Disallow line, bots don't read it then ( Google and MSNbot can). so a line like "Disallow: /emails/*.htm" is not a valid line for most bots. Another rule is, you have to make new user-agent and disallow lines for each specific bots, and you have to make a new disallow line for each directory that you want to exclude. "user-agent: googlebot, yahoobot" and "disallow: /emails, /images" are not valid.
Robots can ignore your /robots.txt. Especially malware or spam harvesting robots that scan the web for security vulnerabilities and email addresses will pay no attention.
The /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use. So don't try to use /robots.txt to hide information.
Is it possible to allow just one file or folder or directory to be crawled and the rest not? Simply there is no allow line in robots.txt, but mentally yea that can be done. How? You can insert all the files that you don't want to be seen in a folder and disallow it.
For example, "Disallow: /files_that_I_dont_want_to_share/ "
Major robots
Major Known Spiders
Googlebot (Google), Googlebot-Image (Google Image Search), MSNBot (MSN), Slurp (Yahoo), Yahoo-Blogs, Mozilla/2.0 (compatible; Ask Jeeves/Teoma), Gigabot (Gigablast), Scrubby (Scrub The Web), Robozilla (DMOZ)
Google allows the use of asterisks. Disallow patterns may include "*" to match any sequence of characters, and patterns may end in "$" to indicate the end of a name. To remove all files of a specific file type (for example, to include .jpg but not .gif images), you'd use the following robots.txt entry:
User-agent: Googlebot-Image Disallow: /*.gif$
Yahoo
Yahoo also has a few specific commands, including the:
Crawl-delay: xx instruction, where "xx" is the minimum delay in seconds between successive crawler accesses. Yahoo's default crawl-delay value is 1 second. If the crawler rate is a problem for your server, you can set the delay up to up to 5 or 20 or a comfortable value for your server.
Setting a crawl-delay of 20 seconds for Yahoo-Blogs/v3.9 would look something like:
User-agent: Yahoo-Blogs/v3.9 Crawl-delay: 20
Ask / Teoma
Supports the crawl-delay command.
MSN Search
Supports the crawl-delay command. Also allows wildcard behavior
User-agent: msnbot Disallow: /*.[file extension]$
(the "$" is required, in order to declare the end of the file)
Examples:
User-agent: msnbot Disallow: /*.PDF$ Disallow: /*.jpeg$ Disallow: /*.exe$
Why do I want a Robots.txt?
There are several reasons you would want to control a robots visit to your site:
- It saves your bandwidth - the spider won't visit areas where there is no useful information (your cgi-bin, images, etc)
- It gives you a very basic level of protection - although it's not very good security, it will keep people from easily finding stuff you don't want easily accessible via search engines. They actually have to visit your site and go to the directory instead of finding it on Google, MSN, Yahoo or Teoma.
- It cleans up your logs - every time a search engine visits your site it requests the robots.txt, which can happen several times a day. If you don't have one it generates a "404 Not Found" error each time. It's hard to wade through all of these to find genuine errors at the end of the month.
- It can prevent spam and penalties associated with duplicate content. Lets say you have a high speed and low speed version of your site, or a landing page intended for use with advertising campaigns. If this content duplicates other content on your site you can find yourself in ill-favor with some search engines. You can use the robots.txt file to prevent the content from being indexed, and therefore avoid issues. Some webmasters also use it to exclude "test" or "development" areas of a website that are not ready for public viewing yet.
- It's good programming policy. Pros have a robots.txt. Amateurs don't. What group do you want your site to be in? This is more of an ego/image thing than a "real" reason but in competitive areas or when applying for a job can make a difference. Some employers may consider not hiring a webmaster who didn't know how to use one, on the assumption that they may not to know other, more critical things, as well. Many feel it's sloppy and unprofessional not to use one.
So, as a web site owner you need to put it in the right place on your web server for that resulting URL to work. Usually that is the same place where you put your web site's main "index.html" welcome page. Where exactly that is, and how to put the file there, depends on your web server software.
Remember to use all lower case for the filename: "robots.txt", not "Robots.TXT.
Major Search Engine Bots - Spiders Names
Google = googlebot
MSN Search = msnbot
Yahoo = yahoo-slurp
Ask/Teoma = teoma
GigaBlast = gigabot
Scrub The Web = scrubby
DMOZ Checker = robozilla
Nutch = nutch
Alexa/Wayback = ia_archiver
Baidu = baiduspider
Specific Special Bots:
Google Image = googlebot-image
Yahoo MM = yahoo-mmcrawler
MSN PicSearch = psbot
SingingFish = asterias
Yahoo Blogs = yahoo-blogs/v3.9
Main source of information can be found in this forum post by Paskall.
Feel free to ask any question, or correct my mistakes,
Cheers
Best site evaluating tools
0I guess I'm enjoying writing "Best..." compilations, and well, it is actually a great way to have all your favorite websites all in one place!
In this list I'll list the best site evaluation tools, either SEO analysis, design accessibility issues...
If you know of any great tool that I missed out, please make sure to comment, it's free!
webite value calculator, blog juice calculator, xinu, website grader, crawl tester, w3c
Blog Juice Calculator:
Probably one of the best blog rating online services out there. Learn how much "juice" your site has using this great tool. It checks almost everything that could be possible checked in a blog, and it advices you on how to improve it.
Website grader:
Another great tool, similar to blog juice but for non-blogs too. It provides you with a grado from 0-100, depending on "over 50 variables". Always good to know if you passed right?
Crawl test
Probably one of the best SEO tools here, it crawls your site and it gives you complete feedback on every single page crawled. You need to register though, and the free version limits the number of crawled pages to 5, but still worth using!
Xinu - Website Analysis
Another SEO evaluation tool, it displays various information about your site collected from all over the web. Very useful tool to know where you must strengthen your efforts!
Website value calculator:
This is more of an entertainment tool than an actual SEO tool, but still, it is cool to check... Right now it says I could sell this blog for 86$...
The grand W3C validation services:
Probably the best tool of all when trying to improve accessibility is this one. There are two validators, the html validator and the css validator. (The image points to the html one)
If you know any more great tools that I missed out, please comment!
Blogging ping list
3Have you ever felt like your blog is not reaching the targeted audience? Maybe it is because the major blogging sites don't even know who you are!
Obviously the first step is to claim your blog in all majors, like Technorati for example, but pinging to only two or three doesn't really do the trick right?
Well here is a solution, add to your ping list a very wide range of blogging sites that will make sure that your blog at least exists .
Ping URL's list:
http://1470.net/api/ping http://www.a2b.cc/setloc/bp.a2b http://api.feedster.com/ping http://api.moreover.com/RPC2 http://api.moreover.com/ping http://api.my.yahoo.com/RPC2 http://api.my.yahoo.com/rss/ping http://www.bitacoles.net/ping.php http://bitacoras.net/ping http://blogdb.jp/xmlrpc http://www.blogdigger.com/RPC2 http://blogmatcher.com/u.php http://www.blogoole.com/ping/ http://www.blogoon.net/ping/ http://www.blogpeople.net/servlet/weblogUpdates http://www.blogroots.com/tb_populi.blog?id=1 http://www.blogshares.com/rpc.php http://www.blogsnow.com/ping http://www.blogstreet.com/xrbin/xmlrpc.cgi http://blog.goo.ne.jp/XMLRPC http://bulkfeeds.net/rpc http://coreblog.org/ping/ http://www.lasermemory.com/lsrpc/ http://mod-pubsub.org/kn_apps/blogchatt http://www.mod-pubsub.org/kn_apps/blogchatter/ping.php http://www.newsisfree.com/xmlrpctest.php http://ping.amagle.com/ http://ping.bitacoras.com http://ping.blo.gs/ http://ping.bloggers.jp/rpc/ http://ping.blogmura.jp/rpc/ http://ping.cocolog-nifty.com/xmlrpc http://ping.exblog.jp/xmlrpc http://ping.feedburner.com http://ping.myblog.jp http://ping.rootblog.com/rpc.php http://ping.syndic8.com/xmlrpc.php http://ping.weblogalot.com/rpc.php http://ping.weblogs.se/ http://pingoat.com/goat/RPC2 http://www.popdex.com/addsite.php http://rcs.datashed.net/RPC2/ http://rpc.blogbuzzmachine.com/RPC2 http://rpc.blogrolling.com/pinger/ http://rpc.icerocket.com:10080/ http://rpc.pingomatic.com/ http://rpc.technorati.com/rpc/ping http://rpc.weblogs.com/RPC2 http://www.snipsnap.org/RPC2 http://trackback.bakeinu.jp/bakeping.php http://topicexchange.com/RPC2 http://www.weblogues.com/RPC/ http://xping.pubsub.com/ping/ http://xmlrpc.blogg.de/
Another tool you might want to give a try is Ping-o-Matic, which (in case you never heard of it) pings some major blogging sites too.
If after all this you still don't get the visitor's you think you should, then you must probably reconsider the topic you blog about!
If this helped you, or you enjoyed this information, please bookmark!
Also if you know of some other ping sites, or one of this links is broken, please comment





