Hi to everybody, i write to you cause i got a big problem: i got a DB, in mysql, with more than 12000 records, each record has some values as name, surname, city, zip code and address.

My task consists to find all record with the same address. Here is the biggest problem because, while city and zip code are entered with a drop down list, and so there are no possibilities to got differences, the address is entered manually in an input text.

Some examples:

main avenue, 21

main avanue 21

or , worst case: te abbreviation:

liberty street, 23

liberty st. 23

So, it depends how the users inserts the address, but the differences can be minimal and refer to the same address. So how can i compare the differences with a bit a tollerance??

I hope that someone can help me.

Thanks and sorry for my bad english 😉

    Thirst thing to do - store your data in database in the smallest chunks of information that have sense. For address it would be separate fields for street, number and flat number.
    When comparing for similar words in a street for example, there are many methods (mentioned here). Search in this forum for "similar", "compare" or "soundex" to find more info.

      I cannot divide the address information in small chucks cause i still got all address in DB and all address are one string only...

        Write a Reply...