Thanks for the response from database expert.
Sorry, I didn't give enough information. Central database is my client's database, I only have read access this time. And I don't have access to their database log, or I am not sure if I can run a database dump to dump the data to my web database. (I can ask but I am not sure if they will give.) Plus in the future, the other clients may not even give me the read access, they may just give me the xml file.
Our web database schema will be different than these clients database's schema. So we will write our own codes to read their data and populate it into our database.
Just like Google Base, we give google xml file, google use the new xml file to totally replace our old data in google base.
So I am thinking like this, if the client gives me read access like this time, I will use it to generate xml sheet, read the xml I will get the most current data. If the clients don't give me read access but give me the xml sheet instead, that will be even better. In the future, it would be better that I would give my xml dtd, ask the clients give the xml according to my dtd if they can.
So, for the given situations, use xml as the data transfer format is the right approach?
If the first step, I use xml as the data transfer. Then second step, empty my web database and repopulate my web database with the xml sheet every time after scheduled reading xml, will that be right too? The web database is small. Focus on one event etc.
I think that is how google base works.