Sxooter wrote:Note however, that PostgreSQL's full text search engine is a bit more complex to implement than MySQL's, but at the same time it's more customizable as well. Just don't expect it to consist of "create full-text-index in table a (fieldb)" "select match('abc' on (field2) from tablea...." if you try it.
I'm thinking that due to this basic memory limitation, and the dependency of MySQL on memory for full test searches, I will end up moving into a full 64-bit system, no matter how much optimizing of code and restructuring of the table(s) that I do now.
So, it seems the first priority would be to look into developing a completely 64-bit 'clean' system, from hardware, OS, application code, etc.
Then, once that is up and running, put as much RAM as possible in it (maybe at least 16gb or so if it can be used) and have plenty of fast (probably RAIDed) storage available. At that point, I should probably install the latest stable versions of PostgreSQL and any needed plugins, drivers, etc.
Do you have any recommendations for tutorials about how to get this setup, web-sites or even books (books seem always out of date though)?
Remember that the database we operate is large but very simple, really. Just a few million records with about 6 or so fields, averaging about 1.5kb /record. It is basically a read-only db, with a full-text index on just 2 fields; the TITLE and CONTENT fields which will be searched for matching keyword phrases (all or any words), then the result sets are sorted by relevancy and paginated out to the user. We also let the user click a link to get the exact record itself on its own page.
We get heavy indexing from the SE bots, about 120,000 page requests per day or so. Googlebots are about 2/3's of that total. Our ACTUAL human user page-requests are less than 10% of that, but that number is growing fast. For this heavy traffic, we need a powerful server, and that will probably get worse as the site db grows. (Naturally, we want the SE bots, since they bring us our 'real' traffic.) We have blocked obvious leeches and bad-actor bots, several hundred of the majors.
Other than daily backups and weekly updates, that's about it. It's big, but much simpler than sites with a balance of writes/updates/reads, like busy blogs, and so on.
I'm ready to jump into the community if you can point me at a site or two where people have some outlines about to-dos for us MySQLers.
Appreciate your migration tips and URL suggestions.