blog :: seo :: semantic search engine optimization

Semantic Search Optimization

Posted on 03 Mar 2009 by - Permanent link Trackback this post Subscribe to this post Comment on this post -  

Search Engine Optimisation has seen exponential growth over the last year and the good times look set to continue as firms make the most of their web assets during the credit crunch. Google’s webmaster group published their guidelines covering the basics of promoting HTML content - the rest is down to effort, discipline, time and creativity - but search technology is seeing some big changes on the horizon.

The emergence of the semantic web is going shake things up big time and require some old SEOs to learn new tricks. Whilst text indexing will remain the bedrock of internet search technology, e-commerce sites that leverage semantic search technologies will find it much easier to get closer to thier customers. SEOs working for such forward-thinking sites are going to have to brush up on their technical skills and go back to the roots of Information Retrieval.

Why?

The semantic web is about presenting data to search engines. Let’s call these snippets of data facts (their technical name is triples, but facts is a great term to help your understanding).

I can write a web page using current HTML technology and the only facts that I give to the search engine crawlers are:

  • “There is a collection of words at this URL”
  • “The words in my heading are more important than the words in the body”
  • “The document is written in British English”

I could add microformats to the page and include these facts:

  • “The document was written by Andy”
  • “The document was published on 18th Feb 2009”

Great, I could ask a search engine for all documents written by Andy between July and August 2008, but few people ever will. They ask for short jumbles of words (“cheap holiday Heathrow spain”) because that works well with text indexing technologies available now. That query really means “I want to see details of cheap holidays to Spain that fly from Heathrow”.

Now consider optimising a page to include these facts:

  • “This document describes a holiday”
  • “The holiday is in Alfarnate, Costa del Sol, Spain”
  • “The holiday includes flights from Heathrow Airport, 234 Bath Road, Hayes, Middlesex, UB3 5AP, United Kingdom”
  • “The flights are with the Jet2 airline”

Are you SEOs reading this getting excited/worried yet?

How?

As an SSO, it’s your job to get your document considered as an answer to as many questions as possible. To achieve this, there are parallels with current, textual search engines:

  • Get your data discovered and crawled by the semantic search engines
  • Build trust in your data
  • Ensure your data is valid and up-to-date

The big deal here is trust.

Semantic web tools start out with a corpus that they trust, often Wikipedia, scientific or government data. They don’t want to pollute that with dodgy data that’s been hacked around to get better rankings so you’re going to need your whitest hat. SEOs are familiar with Google’s TrustRank algorithm, and you can bet that this will be applied to RDF data too.

Links

As always, links will be hugely important.

A shoe e-tailer could link from his product offering to the manufacturer’s product description data. Affiliates and price comparison sites can link to the e-tailer’s product data. This already happens with HTML pages but it will happen with data objects too.

Guidelines for semantic link building will follow similar themes to those for HTML link building:

  • Create compelling content = create useful data
  • Build relationships, don’t buy them
  • Become an authority in your niche
  • Link out to other datasets
  • Follow standards = use an existing ontology
  • Blog and ping = update and ping

Directories Make A Come-Back?

Although discredited for HTML link building, I think that directories will regain favour as people look for good data - at least in the short to medium term. Directories of semantic data sources will be able to capitalise on the machine-readability of RDF and implement a form of automated quality scoring.

When?

You can start now - even the humblest WordPress blog can add hAtom semantic tags by following these instructions.

More complex sites, such as e-tailers should begin planning to deliver semantic data sooner rather than later - perhaps by integrating it with your affiliate programme. Being the first in your niche will count for a lot when other sites have invest effort in linking to your data.

I’d like to bring your attention to an excellent slide show by Jonathan Mendez that discusses the use of semantic data in landing pages. Lots to think about there.

Are you an SEO? Have your thought how “Web 3.0” will change your working practices? Have any customers asked about semantic technologies? Please leave a comment.


Creative Commons licensed photo by lecasio.

0 Trackbacks

Trackbacks are closed for this story.

6 Comments

 Craig Parker said at 2009-03-04 11:23

Great Post

First of all nice post, short and to the point but informative none the less, secondly thanks for the link to the Wordpress tutorial, that's pretty good too.

As an SEM I am recently exploring semantic web a lot more (along with a bunch of the IR stuff) and this post is just a fantastic summary of why people in the industry should be paying attention.

I think it's obvious this will and to a certain extent, already is affecting SEO/SEM and i'm actually really looking forward to how it develops and how the web community reacts. Currently we havn't had any clients specifically ask us about semantic web but I think as the Web 3.0 buzzword gets passed around more it will definately be coming up in meetings.

As far as the way we work we are already changing the things we do, trying to stay on top of the semantic web standards and making the newer projects we work on compatible at build, not always easy in these early stages though!

 MMMeeja said at 2009-03-04 11:36

Re: Great Post

Thanks for your kind words, Craig.

I think we're close to a semantic web tipping point so SEOs need to be exploring and making plans for it right now. Interesting times...

 Raphael said at 2009-03-04 11:53

Let's be critical

Everywhere you get in the web looking to the future you here phrases put together with "semantic" and never I really understood what this should meen more then search engines do now: collecting facts by criteria that are far away from being semantic.

Of course user are searching in a semantic way, how they could do different? But google and other engines aren't in any way able to answer this questions in a semantic way. And they will not.

What google does is collecting pages and tag them, giving them trustpoints and other criteria. Now somebody is searching something. Google is tagging this question and asks the database which page ist tagged very similar and has enough trust to show this the questioner.

And there trust was always important. Pagerank was something to simulate this. Did not work good enough. But was better then metakeywords and -descriptions. Now trust is a mixture from a lot of criteria, but not in any way semantic.

Google is getting out spam, not collecting good pages. Therefor is spam. So Google does not show you the best pages in serps but the most not bad pages which are the most similar to the tag-translation of your question.

This makes your post not wrong in any way, but to talk about web 3.0 because of some more criteria in search engines is in my point of view a bit to excited.

For SEO will not change more then in the last yaers. You have to see and find out the criterias of Google.  Not more, not less. It's enough work. If there are new ones, better be fast.

 

 MMMeeja said at 2009-03-04 12:16

Re: Let's be critical

I agree that Google's text matching algorithms are not semantic and open to abuse by spammers and blackhats but I'm looking to the future when search engines can find meaning behind in text and other data.

A good example is this Powerset search for "How old is Barack Obama?". Sure, it's using trusted Wikipedia data but hopefully soon we'll be able to ask "Where can I buy size 60 Adidas shoes in Berlin?" and get an answer of similar accuracy, not a list of pages containing relevant text.

Much of the reaction to this post on Twitter has been worry about how SEOs will abuse the next generation web (Hi indymike) and that's a valid concern. I'm sure it's being addressed by the semantic search developers and the blackhats too.

 Raphael said at 2009-03-04 12:49

Meaning

It's just, that I don't think, that it will be possible, that maschines understand meaning, because it is nothing you can translate completly int 1 and 0. Menaing is a net of appreciation and this is something what have to be understood to. I don't want to say, that it is completly impossible, therefor we know to less about how we understand, but exactly there is the border. Because from now we can teach maschines just what we really understand.

Of couse will search engine sometimes be able to understand some questions like you pointed out. But not because they are understanding mor meaning, but beccause webmasters and SEOs understand better the way of searching by people and the translation by google.

The way to understanding maschines is still a long way and I gess we will not been udnerstood by one in our lives anymore, but this is to prophetic;)

Now I just see a lot of things changing in consciousness of people building webpages and people searching in the net. And through this will change a lot, there I am with you for sure. But I don't see the this change in new technic but in this what people how do.

And through this maybe we even could say that the web gets more semantic in a non-direct way.

PS: Excuse my language but some years in school are just not enough. Hope you understand what I try to tell.

 MMMeeja said at 2009-03-04 13:40

Re: Meaning

Oh, I totally agree that the semantic web will not be like the thinking, talking computers we see in the movies (I kind of hope that we'll never see that). We webmasters and SEOs will be working with databases not just web pages and we'll need to adapt to that.

Your English is much. much better than my German, despite working in Frankfurt a few times. Thank you for your comments.

Comments are closed for this story.

 

Sitemap

Copyright © 2006-2009 MMMeeja Pty. Ltd. All rights reserved.