Web spiders can spider PHP pages.
But they won't go infinitely deep into your site (obviously), especially if you use query strings, especially if you have many similar pages.
Tips:
- Make each page have JUST ONE URL. So always be totally consistent when you refer to it. So if you have blog.php?id=1 don't also have blog.php?id=1&useless=42 or other URLs pointing to the same item. Also watch for stuff like blog.php?cat=1&id=2 and blog.php?id=2&cat=1 which are DIFFERENT URLs.
Search engines have limited resources to spider your site - ensure that they use them wisely.
- Use robots.txt to ban robots from pages that are of no interest to them. This is particularly relevant if you have something like logon.php?from=somepage.php linked from somepage.php - You don't want to double the number of pages on your site with links to logon.php with different query strings.
Normal pages to ban robots from are:
- Logon pages
- Register pages
- Contact forms
- Anything else which requires interaction and is therefore of no interest to a bot.
Send back sensible html.
Make page titles relevant and different on each page! This is really important.
Use real links, not javascript dropdowns or other malarky (Except for areas of your site which search engines are forbidden from anyway)
Make headings real headings, not <div> etc
Make your HTML well-formed and valid, where possible. Search engines can handle invalid HTML to a point, but may less resilient than browsers.
Mark