Archive for the 'Search Engine Optimisation' Category

Apr 18 2012

Common SEO Issues: Canonicalization

Published by Oakley under SEO, Search Engine Optimisation

Canonicalization has always been an issue for SEOs. Years ago Google was notoriously bad at working out some pretty basic issues, but these days it’s not too bad with the simple issues, and provides easy tools to help sort them. Gone at the days when site.com, www.site.com, www.site.com/index.php etc were the biggest canonicalization problems. They are rarely problems these days and easily fixed.

However, there are many other problems which are common (and some less so) but often over looked. It’s always been an issue, and with duplicate content being a big issue in Panda, it’s now doubly time to sort out those issues.

Just to be clear, a canonicalization problem is when there are multiple versions of the same page, or close variations of it, on the same domain. That’s a type of duplicate content, although duplicate content also covers across domains. Cross domain duplicate content isn’t a canonicalization problem (usually) so I’ll be excluding that. The word “canonical” means definitive, the original, the main version. So when there’s two pieces of identical or similar content, the canonical version is the one you want Google to know about. The other needs to be either removed or an instruction given that it is not the canonical version.

Here’s a few problems that I run into now and then, with some possible solutions. This is by no means a definitive list of problems and solutions, but it’s a start.

Layered Navigation

Layered navigation (sometimes called filtered, faceted or guided navigation) is a wonderful thing in ecommerce sites which I personally love using as a user. It’s gotten a more popular over the last few years, in part due to the rising popularity of Magento which has layered navigation out of the box. Most enterprise search software packages also include some sort of layered navigation option. However, when you add a facet to your search, you often add a new URL. There is effectively an infinite number of pages, many containing the same or similar content.

Possible Solutions

This can be both an opportunity and a problem. By carefully managing canonicalization, you can create a large number of useful pages while eliminating useless pages. For example, if you are selling bikes and have 3 facets: price range, type (mountain, racing), and age (child, teen, adult). Pages on “mountain bikes for teens between $500 and $750” and “mountain bikes for teens between $750 and $1200” are not likely to be useful. Those pages should be eliminated. Generally the best way is to use the canonical tag. However, a page on “mountain bikes for teens” may well be useful and may attract some high quality long tail searches. So this variation should be allowed to live. The idea is to pick which facets add value from a search perspective and which don’t. Price rarely adds value. The other facets are very dependent on your niche. If you are unsure, safer to leave it out, or consult an expert.

This can be non-trivial to implement technically in some cases, but it can be worthwhile. I’m not aware of a Magento extension which handles this well for Magento. If one exists, please let me know!

Ecommerce Product Pages

This applies to other general CMS scenarios as well at times, but the problems are most common in ecommerce shopping cart software. This is a bit fuzzy as it varies a lot depending on your ecommerce platform and configuration. Some problems I’ve seen:

Pages in Multiple Categories

Often it makes sense to have the same page in multiple categories. For example, on a bike shop, you might have a mountain bike category and a teen category. So you could end up in a scenario where you have the same content at:
www.site.com/teen-bikes/red-mountain-bike.html
and
www.site.com/mountain-bikes/red-mountain-bike.html

Possible Solution

Some ecommerce platforms solve this by having a default category so the problem above doesn’t exist. That is a mixed blessing and can cause other usability and IA issues beyond the scope of this discussion. The solution when there are two URLs is to nominate a default category and use the canonical tag. Your software needs to have support for this, and it can be tough to explain to editors.

Multiple Versions of the Product Pages

This is pretty fuzzy, but can and does happen. For example, I saw a site recently where there were standard product pages, but then also a near-identical version (without the menu, etc) as a popup that would appear from a link in the checkout and a few other places.

Possible Solution

Don’t do it. If you must for whatever reason, use a canonical tag on the rogue page back to the main page, or a simpler, not quite as good but good enough solution is to add a meta noindex tag to the rogue page. Chances are rogue pages will attract no inbound links so noindex won’t be of any harm.

Different URLs for the Same Page

This is typically sloppy programming. Some examples are:
www.site.com/category/product.html
also appears as
www.site.com/product.php?id=1234
Another one I see a lot is
www.site.com/products/10384/product-name.html
however, 10384 is the actual key, and product-name is just there for vanity reasons and is ignored by the software. You could type in
www.site.com/products/10384/i-love-chickens.html
and the exact same content would appear.

Possible Solutions

For the first one, if the system is well programmed then the product.php will never appear anywhere and so the spiders will be oblivious to it so there’s no problem. However, this isn’t always the case. The solution is to make it the case. There are other work arounds but they are all inferior.

For the second scenario, while that is quite common, in my experience software that does that invariably does it well and variations of the vanity string are never seen, so I wouldn’t worry about it. It should be fine. There are some edge cases where problems can occur, but I’m trying not to turn this into a 100,000 word book (although I suspect you could write 100,000 words on canonicalization). One of the edge cases is where smart alecs like me change the string to something funny so an observant webmaster will see some bizarre page requests appearing in their Google Analytics report! I mentioned my habit of doing this from time to time to Danny at First Rate and it turns out he does exactly the same thing!

Tracking Codes

A quite common problem is tracking codes of various types. Some of these include:

www.site.com/affid=1234 if you have an affiliate program

www.site.com/?utm_source=somesource&utm_medium=somemedium&utm_campaign=somecampaign if you are using Google Analytics campaign tracking (which you probably should be). Adobe Site Catalyst and other packages have similar codes.

There are other scenarios where various redundant query strings like that are appended.

Possible Solutions

There’s a few ways to solve this. By far the easiest is to go into Google Webmaster Central and tell Google to ignore that query string. 5 minutes, done. There are other solutions, you can use 301s, canonical tags, replace ‘?’ with ‘#’ (which can be done with Google Analytics with a minor tweak), and probably others I can’t think of, but they all feel like more work and more error prone to me.

Case Issues

The web is fundamentally case sensitive. www.site.com/page.html and www.site.com/Page.html are different URLs. Unfortunately for those running on Microsoft’s IIS Server, it sees them as the same. This can encourage sloppy case insensitive programming which on a *nix server would show as an error. I’ve seen sites with internal links pointing to www.site.com/page.aspx and www.site.com/Page.aspx. These pages could then be crawled by Google and potentially appear as duplicate content.

Slightly off topic, this can cause problems with robots.txt. I once worked with a client who had a page which it was quite important to not be indexed. I added www.site.com/secretpage.html to robots.txt. The site was running on IIS. Unfortunately, there was a number of both internal and external links pointing to www.site.com/Secretpage.html, www.site.com/SecretPage.html, www.site.com/SECRETPAGE.html and so on. Given robots.txt is case sensitive I ended up adding more and more case variants to the robots.txt and eventually it was de-indexed. This is a nasty scenario, and I’m not aware of a work around – please let me know if you are (apart from “don’t use IIS”!).

Possible Solution

To the best of my knowledge, there’s no easy way to make IIS case sensitive. There might be some ISAPI solution, and it can be done at an application level, but in practice both those solutions are generally not feasible. The solution here is simple good coding practice and hope for the best. There are probably some advanced things you could do, but in practice they would be more work than good coding practice. As for the robots.txt problem, it’s got me stumped.

Blogs

Out of the box, Wordpress is notorious for having canonical issues. A new blog post can appear at:

  • www.blog.com
  • www.blog.com/2011/post-name.html (or similar depending on configuration)
  • www.blog.com/category/categoryname
  • www.blog.com/2011/12 (the archives)

and a few more. So which is the canonical version?

Possible Solutions

In the case of Wordpress it’s pretty easy – just install the Yoast SEO plugin for Wordpress. For other blog software which will typically have similar problems, that’s a bit trickier. You are going to have to do some homework or call in some experts.

Content Distribution Networks

A CDN (content distribution network) often uses cnames to map many subdomains to the relevant local data centre. Even a single sub-domain, such as used by the Amazon AWS CDN, is still potentially duplicate content. So, the same content could be coming from
www.site.com/page.html
and
cdn.site.com/page.html

Possible Solutions

  1. Use the cross domain canonical tag
  2. Configure your CDN to use the rel=”http” canonical HTTP headers

By Mark Baartse, Consulting Director

Related Posts:

No responses yet

Oct 07 2011

The SEO eCommerce Checklist

Our consulting team is often asked about what areas should be focused on when thinking about search for purchase driven or etail sites, as in today’s retail market companies really need to gain that competitive advantage. So, we’ve compiled this checklist as a foundation for what we like to call a ‘search friendly’ site.

The ultimate goal of search engine optimisation is to get Google to nod it’s head to your keywords, you can control this:

1) On-page Optimisation: keyword optimisations you make in your web page code

  • Distinguish between “content” and “administrative” pages – then optimise only content pages.
  • Choose just one keyword phrase per content page
  • Use the keyword in your page’s file name; delimit with dashes
  • Use keyword in the page title, “meta description” (twice if possible)
  • Use keyword in a headline (use H1 or H2). Usually this is your page title.
  • Use keyword several times in the body text where reasonable.
  • Make sure your content is useful. Ok, search engines aren’t that smart, but visitors are.

2) Off Page Optimisation: The links coming into your site

  • Submit your site to all the standard directories
  • Submit article-style content to for syndication, making sure your content contains many keyword links back to your other content pages. This will get you hundreds of quality keyword links.
  • Periodic press releases.
  • Create a Google site map and submit it to Google.
  • Create RSS feeds of your content and submit to various feed lists. This method will get you several hundred minor links per content page.
  • Add a “link to this page” box on every content page to encourage deep-linking with the right keywords
  • Locate quality sites linking to your competitor – contact the owner and suggest that they mention your site as an alternative option for their visitors. It’s amazing how many quality links you can get just by asking politely.

3) Site Optimisation: The navigation and linking structure you use across your site

  • Don’t use both www.yourdomain.com and yourdomain.com. Pick one or the other and stick to it
  • Don’t link to your home page as index.html. Google only knows pages by link target
  • Each content page should be linked to with one consistent keyword-friendly URL, and never with parameters

bad: http://firstrate.com.au/forum.php?topic=873736
good: http://firstrate.com.au /my-keyword-tp873736.html

  • Link to each content page with static text links (not JavaScript, not Flash, or Images) and the anchor text of links should contain the target page’s keyword phrase.

bad link: “click here for split testing tools” - this just confuses the search engine
good link: “click here for split testing tools”

  • Don’t link back to your home page with the link text “home”…
  • Interlink between content pages from within paragraph text. These links seem to “count” more and will get you a bigger ‘nod of the head’ from Google.

Site Optimisation is the most powerful SEO area over which you have total control! It’s all about how you link your pages together. If done properly, it can have a significant impact.

These are only some suggestions to think about. If you have any questions about best practice SEO for etail, write us a comment below or send an email to experts@firstrate.com.au

Related Posts:

No responses yet

Aug 04 2011

IAB Award Winning Case Study: Focus Property

Download our award winning case study

The Power of Longtail: Focus Property

First Rate increases leading brand’s search engine rankings, powering their online business so that they rank above their competitors.

We have 30 consultants across Sydney, Perth and Auckland managing over 100 Google Adword Campaigns and generating over 1 million clicks a month and are proudly Google Analytics and Google Adwords certified.

Related Posts:

No responses yet

Jul 29 2011

First Rate Wins IAB Award for Organic Search (SEO)

First Rate has taken the top prize for the IAB Australia Awards 2011, in the Organic Search (SEO) category for our client Focus Property.

The awards were held last night in Sydney and we’re all stoked to have claimed the top prize amongst a high calibre of entrants. Judges commented that our entry was

“a very well laid out entry with a nice focus on the right aspects of SEO. The point of differentiation is that they ticked criteria for SEO but went above and beyond focusing on Longtail strategy.”

Award Winning Case Study: The Power of Longtail, Focus Property

The state of play

Focus Property is Sydney’s leading property management firm. Currently managing over a 1/4 billion dollars worth of residential real estate with in Sydney’s CBD. Focus Property enlists services such as sales, letting and property management.

Our client operates in a highly competitive market with many large and established players, such as Ray White, LJ Hooker, Run Property and the Carrington Group. The competitors had strong presence in not only the property management segment but also the sales and letting segments.  Focus Property realised that an increasing amount of new business was being generated through the web as buyer behavior evolved, especially through the search engines.

The Strategy

First Rate was engaged to help improve search engine visibility and drive qualified organic leads and increase footprint. While there was a significant traffic available for head terms such as “property management”, these would not be best for converting into high quality leads. This is because they were generic and at the research phase of the buying cycle rather than being brand focused in their search queries.

First Rate discovered Focus Property had service offerings in 118 suburbs that had sufficient volume to blanket the long tail search market.  Long tail local keywords were focused on because collectively the volume of these keywords was large and anticipated that conversion rate and lead quality would be greater. This was consistent with focus on generating high quality traffic. Personalized landing pages were created for the target suburbs, giving more personalized pages and relevant content. Suburb pages were created and optimized for terms such as “<suburb> property management” and variants of that. This would then result in high quality localized leads.

Tactics

Content would be needed to target the identified keywords for example, “Bondi Property Management” and no 1 for virtually all terms relating to suburb name, property management as above. Working with Focus Property, we created pages for each of the 118 target suburbs, containing information and photos specific to each suburb Information was sourced from a number of locations such as the Australian Bureau of Statistics. This created rich content for the search engines, as well as creating a relevant, personalized landing page experience for prospects as originally forecast.

Results

We saw not only a significant increase in unique visits and visits from organic search but also an increase in ranking for head and secondary terms securing the head term as ranking number 1.

  • In 12 months traffic jumped from 481 visits per month in Dec 09 to over 3300 unique visits per month in Dec 10.
  • Monthly visits from non-paid, non-brand organic search increased 15 fold (from roughly 110 visits in Dec ’09 to over 1,700 visits in Dec ’10).
  • #1 ranking in Google for head term “property management” achieved in early 2011.
  • Top 5 rankings for a variety of secondary terms.



Get in touch with our team for more information about the power of longtail or SEO.

experts@firstrate.com.au

Related Posts:

One response so far

Next »

 



First Rate on LinkedIn