We use cookies to provide you with the best possible website experience. This includes cookies that are necessary for the operation of the site, as well as cookies used for anonymous statistics, comfort settings, or displaying personalized content. You can decide which categories you want to allow. Please note that depending on your settings, some features of the website may not be available.

Cookie settings

These necessary cookies are required to enable the core functionality of the website. Opting out of these cookies is not possible.

cb-enable
This cookie stores the user's cookie consent status for the current domain. Expiry: 1 year.
laravel_session
Stores the session ID to recognize the user when the page reloads and to restore their login session. Expiry: 2 hours.
XSRF-TOKEN
Provides CSRF protection for forms. Expiry: 2 hours.
IZA Discussion Paper No. 15941
February 2023
On the Validity of Using Webpage Texts to Identify the Target Population of a Survey: An Application to Detect Online Platforms
Piet Daas, Wolter Hassink, Bart Klijs

published in: Journal of Official Statistics, 2024, 40 (1), 190-211

A statistical classification model was developed to identify online platform organizations based on the texts on their website. The model was subsequently used to identify all (potential) platform organizations with a website included in the Dutch Business Register. The empirical outcomes of the statistical model were plausible in terms of the words and the bimodal distribution of fitted probabilities, but the results indicated an overestimation of the number of platform organizations. Next, the external validity of the outcomes was investigated through a survey held under the organizations that were identified as a platform organization by the statistical classification model. The response by the organizations to the survey confirmed a substantial number of type-I errors. Furthermore, it revealed a positive association between the fitted probability of the text-based classification model and the organization's response to the survey question on being an online platform organization. The survey results indicated that the text-based classification model can be used to obtain a subpopulation of potential platform organizations from the entire population of businesses with a website.

Kommunikation
Mark Fallak
mark.fallak@liser.lu
+352 585-855-526
World of Labour
Olga Nottmeyer
olga.nottmeyer@liser.lu
+352 585-855-501
Netzwerkkoordination
Christina Gathmann
christina.gathmann@liser.lu

Das IZA@LISER-Netzwerk ist eine weltweite Gemeinschaft für exzellente Forschung in der Arbeitsmarktökonomie und angrenzenden Fachgebieten. Nach dem Wechsel von Bonn wird das Netzwerk nun am Luxembourg Institute of Socio-Economic Research (LISER) koordiniert.

Über das IZA@LISER Network
Contact
IZA Network (Current Site Operator):

Luxembourg Institute of Socio-Economic Research (LISER)
11, Porte des Sciences
Maison des Sciences Humaines
L-4366 Esch-sur-Alzette / Belval, Luxembourg

IZA Institute (In Liquidation):

Forschungsinstitut zur Zukunft der Arbeit GmbH i. L.
Schaumburg-Lippe-Str. 5-9, 53113 Bonn. Germany
Phone: +49 228 3894-0 | Fax: +49 228 3894-510
E-Mail: info@iza.org | Web: www.iza.org
Represented by: Martin T. Clemens (Liquidator)