Plateforme Level Extreme
Abonnement
Profil corporatif
Produits & Services
Support
Légal
English
Fifty ways to find your brother - searches
Message
De
29/11/2019 12:33:52
 
 
À
29/11/2019 01:55:44
Information générale
Forum:
Visual FoxPro
Catégorie:
Codage, syntaxe et commandes
Divers
Thread ID:
01672112
Message ID:
01672119
Vues:
88
Hi Thomas,
Not a lot of time to look at it today but my quick look says its very interesting!

Here is the link: http://fox.wikis.com/wc.dll?Wiki~LevenshteinAlgorithm

Thanks,
Albert

>Google for Levenshtein in vfp and you will find my vfp-optimized code, IIRC foxpro Wiki
>but it is much better if you link in one of the C versions, as vfp string mechanics are not well suited to the task
>Best done twice: once correct, once phonetically simplyfied/normalized.
>
>HTH
>
>thomas
>
>>Okay, am tired looking at your list :-)
>>
>>This is plaintiff data, relatively small (only about 50k records after 40 years in business). And we don't get the address, phone data etc. until the file is settled (years into the process) - not that I would have wanted to have had to add that into the mix.
>>My guess is that it would have taken a LONG time to work all that code into your searches.
>>
>>What is "edit distance" - sounds useful (for what, I have no idea).
>>
>>Albert
>>
>>>yes to all except for 3,4 and 13.
>>>Not PHdBase, but 2 similar approaches
>>>Plus a few more, like calculating edit distance as they call it today ;-)
>>>Plus doing the same on adress fields
>>>plus contact data (multiple phone, email...)
>>>Plus a rule engine which can be tweaked for certain profiles of data
>>>Plus a scoring system so you can order the "relevance" estimated for further tweaking.
>>>
>>>used to run regularly on 1 - 9 million data entries, linked across more than a couple of related tables.
>>>looking for duplicates, weeding/singling out family groups
>>>target marketing
>>>
>>>system grew over a few years ;-)
>>>
Précédent
Suivant
Répondre
Fil
Voir

Click here to load this message in the networking platform