Forum Moderators: phranque

Message Too Old, No Replies

Internal Linking

Issue in finding internal link

         

austinmartin

1:15 pm on Nov 23, 2011 (gmt 0)

10+ Year Member



I am working on a website, some pages of that website hasn't linking from home page(index page) but these are exist and index in Google. But I am unable to find out from where Google get index these find. Even I also having problem to find out link to that pages.
E.g. : www.redemptiontoys.com/arcade_small_toys_prizes.html page is exist and index by google but not linking from home page or even another pages.

I always respect to your response.

Thank you.. ! :)

piatkow

1:31 pm on Nov 23, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Once Google have the page indexed it will stay indexed as long as it exists. If a link that no longer exists was spidered in the past or if it was in a site map then they will have found it and will continue to index it.

I have seen plenty of examples of orphaned pages continuing in the SERPS and plenty of webmasters who thought they were "taking down" a page by making it an orphan.

lucy24

8:17 pm on Nov 23, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Does anyone actually visit the pages? If so, logs will tell you how they got there. Unless, of course, the only way they ever get there is via search engines.

austinmartin

4:42 am on Nov 24, 2011 (gmt 0)

10+ Year Member



@piatkow and @lucy24 thanks for your contributions.. :)

Jonesy

7:50 pm on Nov 26, 2011 (gmt 0)

10+ Year Member Top Contributors Of The Month



I've often wondered about this, too.
I have a not-NEVER-linked-to-by-ANYTHING web page mainly for personal use --
local weather conditions, weather radar image, and a few local web cams.
But, I have passed the URL on to several friends (who have NO idea how to put
up a web page or a blog). Yet Google has the page indexed -- which
really is not a concern.
I'm guessing it got into the wild via
either:
1. A Google-like toolbar.
2. A Facebook or Twitter posting.

lucy24

12:46 am on Nov 27, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



3. gmail

MichaelBluejay

7:42 am on Nov 30, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



There are so many ways that Google finds pages on its own, which is funny, because newbies are always asking, "How can I get my pages into Google?", when the reality is that it's harder to keep your pages *out* of Google!

phranque

12:17 pm on Nov 30, 2011 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



you might find this thread interesting...

Top 20 Stealth Links - Getting your url in front of Search Engines by nontraditional means:
http://www.webmasterworld.com/google/3893713.htm [webmasterworld.com]

enigma1

12:50 pm on Nov 30, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



plenty of webmasters who thought they were "taking down" a page by making it an orphan.

In fact you are taking down a url if it's an orphan. It may take a while but eventually spiders will drop it unless you have references somewhere in your domain the spider can trace and imply a problem with the code you use.

piatkow

2:46 pm on Nov 30, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month




In fact you are taking down a url if it's an orphan.

I have seen pages stay in the SERPS despite all links being removed. They were only found with some very obscure long tail searches but they were still indexed.

Back to the OP, removing internal links does not guarantee that all links are removed. Somebody may have deep linked to the site.

enigma1

4:13 pm on Nov 30, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have seen pages stay in the SERPS despite all links being removed

Yes spiders may keep accessing them no matter what you do. If the links are listed on other domains bots will follow. So for example if you buy a domain and post completely different content the spiders may keep accessing the old links for an eternity no matter what headers you output 200, 301, 404 doesn't matter. They will follow any url if they find it internal or external.

Now if you see errors in GWT it maybe different and signify a problem with the application somehow generating invalid urls. You need to watch for those and in many cases google doesn't even state the path to trace them.