Plateforme Level Extreme
Abonnement
Profil corporatif
Produits & Services
Support
Légal
English
Looking for comments
Message
De
17/03/2005 11:10:09
 
 
À
17/03/2005 03:24:21
Information générale
Forum:
Visual FoxPro
Catégorie:
Applications Internet
Versions des environnements
Visual FoxPro:
VFP 8 SP1
OS:
Windows 2000 SP4
Network:
Windows 2000 Server
Database:
Visual FoxPro
Divers
Thread ID:
00996657
Message ID:
00996844
Vues:
17
Jos, it's probably just my imperfect understanding of the web site and it's operation, but I found it a bit confusing. As a simple test, I searched 'weather' under 'political parties'.

The first results page showed only 2 items - 3, and 5, and Docs Found = 1634. After hitting 'Next 10', I got a page with 8 items - 11, 12, 13, 14, 15, 16, 18, 19 and Docs found = 2576. Next 10 gave me 7 items - 21, 23, 24, 25, 26, 27, 28 - docs found still 2576. Each 'next 10' I clicked after that seemed to give me the same 'docs found' number, but never actually 10 items, and always some missing numbers.

Maybe that's because of the 'consolidate by website' checkbox, but I don't really know.

>Hi All
>
>We are at final beta testing stage of a VFP powered website which I was hoping some of you fellow VFP’ers might find a moment to give a test run and provide some feedback and even discover any last minute bugs or problems. Hopefully not too many of those left! ;)
>
>Background:
>
>The Scannery is a specialist, vertical content set, search site. It provides the ability to search the websites of specific content sets such as public companies or government websites from around the world. The target market for this website is professional researchers, analysts, journalists, investors, etc.
>
>A few technical facts and figures:
>
>
  • The site tracks over 35,000+ public companies in 120 countries, of which 27,000+ have websites. 15,000+ government websites and 1800+ political party websites in over 190 countries. And every major stock exchange in the world. Additional content sets are being created.
    >
    >
  • The entire operation is run by VFP 8 application’s which maintain the database of websites, controls and manages the web spiders, schedules indexing jobs, handles the search requests from end-users, and records search results and statistics.
    >
    >
  • The site uses a commercial third party search engine and web spider which we are busy writing up a case study for - time permitting. (Apologies in advance but we do not wish to discuss this third party application at this time until our case study has been finished).
    >
    >
  • Search requests are passed from the end-user, via FoxWeb (sorry Claude :), to our VFP programs for processing. Our programs in turn use the third party search engine via a COM interface to perform the actual search. The results are then packaged, analyzed, and formatted in VFP before being sent back to the end-user's browser.
    >
    >
  • The website spiders process over a terabyte of data every 4 months and produce a searchable index of +/-100 gigabytes across all content sets. So far there are about 20 million pages indexed from +/-44,000 websites.
    >
    >
  • In addition the site provides direct links to some 1.6 million downloadable documents in various formats like XLS, DOC, and PDF files, that have been identified on the websites that have been indexed.
    >
    >
  • Please note that some websites do not allow themselves to be indexed or cannot be indexed if they have flash animation entry points or use certain types of scripting techniques. For instance IBM does not allow their site to be indexed.
    >
    >
  • The site is in final beta and is running on a humble development server during the testing phase. Be gentle :)
    >
    >Thanks
  • Précédent
    Suivant
    Répondre
    Fil
    Voir

    Click here to load this message in the networking platform