Forum Moderators: open
My site has few pages in HTTPS version and few in HTTP version.
Problem is I have linked certain pages like home page, sitemap page and services page links in Footer section of HTTPS version pages, now Google has indexed my domain as [mydomain.com...] , yahoo and msn has indexed certain html HTTPS pages which are not linked in footer section of my HTTPS page
For example
Linked pages in footer section of HTTPS version pages
[mydomain.com...]
[mydomain.com...]
[mydomain.com...]
Pages which are not linked in footer section of HTTPS version pages but still got indexed in yahoo and msn
[mydomain.com...]
[mydomain.com...]
[mydomain.com...]
If you click on above pages all pages will redirect to respective HTTP version pages with 302 methods
Now big question how search engine (yahoo and msn) has indexed static html pages with HTTPS version without any link to any of my page.
How I can remove those [mydomain.com...] pages from robots.txt file or .htaccess file
Questions
How can I prevent indexing HTTPS version pages, excluding my landing page?
What should I do to stop crawling my main domain with HTTPS version (i.e. [mydomain.com)?...]
Just exclude the https pages in robots.txt.
Also, why are you 302-ing https pages to http pages (if I'm understanding the post correctly)? That's not good. The SE's will associate the https URL's with contents from the http target pages.