I've moved...

This blog now has a new home - please update your shortcuts and readers to: www.jeffkemponoracle.com. Sorry for any inconvenience!

Monday, October 26, 2009

Google indexes duplicate pages from my Apex site - problem solved

Problem: when Google indexes my Apex web site, it considers the following URLs to be different pages:
http://www.site.com/apex/f?p=100:1:1234567890::::
http://www.site.com/apex/f?p=100:1:0::::
http://www.site.com/apex/f?p=MYAPP:1:46346346346::::
http://www.site.com/apex/f?p=MYAPP:1:34634634636::::
http://www.site.com/apex/f?p=MYAPP:HOME:46346346346::::
http://www.site.com/apex/f?p=MYAPP:HOME:0::::

Notice how my application with ID 100 has an alias of MYAPP, and page 1 has an alias of HOME; also, more duplicates happen due to the session ID; all these URLs point to pretty much the same content, but Google indexes them all as separate pages.

[Full Article]

4 comments:

  1. Jeff,

    Thanks for the info! Do you think it will work with Google Analytics on APEX applications?

    Martin

    ReplyDelete
  2. Nice.

    have you had a sneak peak at what AskTom might be doing? I've previously noticed some entries come up on google, but never really thought about it.

    Have you tried to say canonical 10 times fast?

    ReplyDelete
  3. > D'Souza: Google Analytics

    Sorry, I don't know.

    > Wesley: AskTom

    I've had a quick look, but it doesn't appear he has - if you google "site:asktom.oracle.com" and went through all the results carefully I dare say you'd find a few duplicate links there because of this, partly because in many cases Google has been assigned various session IDs by the apex site when it crawls it (this is mitigated somewhat in some cases where the session ID is zero, but not always).

    "canonical 10 times fast"... easy!

    ReplyDelete
  4. See this?
    http://joelkallman.blogspot.com/2009/11/who-uses-asktomoraclecom.html

    ReplyDelete

Note: only a member of this blog may post a comment.