Hi,
there are going on some strange things on my site according to Google Webmaster Tools.
There are a lot of 404s and a lot of “other” errors (Webmaster Tools does this categorization: server errors, soft 404, forbidden, not found (404), other). Google tries to crawl those URLs and gets the errors. I wonder how Google is finding this URLs?!
1) 404s - Example
Google tries to crawl
http://www.lehrerfreund.de/schule/1s/wordle/P1800
The P1800 seems to be a pagination-segment. Well the correct URL to the entry “wordle” is
http://www.lehrerfreund.de/schule/1s/wordle-interpretation/
or
http://www.lehrerfreund.de/schule/1s/wordle-interpretation/3200
Now I wonder why he is getting the wrong URL, and especially why he is getting the pagination-segment. When I look in Webmaster Tools, wherefrom the links are coming, the list looks like this:
http://www.lehrerfreund.de/schule/1s/wordle/P600
http://www.lehrerfreund.de/schule/1s/wordle/P1080
(and more)
The structure of the 404s is obvious and in most cases similar: It’s the correct template-path (schule/1s) but then the url-title is malformed (e.g. wordle instead of wordle-interpretation), followed by a pagination segment.
2) Other errors
I also get thousands of errors, where the malformed URL has the structure
in/technik/{title_permalink=schule/1s}/P100
The error-code given back is always 400.
The full URLs cannot be posted here, they break the entry, therefore 2 examples as code. I had to fill in a lot of blanks, otherwise the entry is broken:
http:// <a href="http://www.lehrerfreund.de/technik/1s/uraltkran/">http://www.lehrerfreund.de/technik/1s/uraltkran/</a> % 7b permalink= {tec_my_template_group % 7d/tell_friend % 7d
http:// <a href="http://www.lehrerfreund.de/schule/1s">http://www.lehrerfreund.de/schule/1s</a> / % 3c/ dl % 3e % 3 cp % 3e % 3ca % 20 href= / P120
As you can see there seems some EE-tag (like title_permalink) to be not parsed correctly. This is even more strange as I have removed the segment /in/ (which was my replacement for index.php a long time) with the htaccess.
I have double checked my templates and it seems like there are no syntactic errors in the templates.
I would really appreciate any idea how I could fix this.
Thanks in advance!