Data Big and Small

Where do restaurants come from?

How do digital platforms affect the concrete functioning of markets that pre-existed them? Platforms are intermediaries and it was initially thought that they could solve any mismatches between supply and demand. In the restaurant sector, the hope was that they would seamlessly connect diners with available tables and help restaurants fill their rooms. Yet traditional booking methods remain, and many restaurants restrict the number of seats offered through platforms. A recent study, which I have just co-published with Elise Penalva Icher and Fabien Eloire, examines why.

We borrow Harrison White’s famous producer market model, based on the idea that the key problem of a firm is to position itself in a market that consists of differentiated niches. Restaurants are not homogeneous, and they continuously scan the market to fine-tune their offer – from fine dining to bistro and pizzeria. They evaluate two main indicators: volume, which is relatively straightforward, and quality, which is harder to gauge as it depends on subjective customer perceptions. Platforms break through this limitation by publishing consumers’ reviews and aggregating them into ratings. They provide “digital glasses” that reveal quality alongside volume.

The study investigates dine-in services in Lille, France, in the case of a widely adopted booking and review platform. Methods include participant observation, interviews, web-scraping, and quantitative analysis of business data.

Lille restaurants in Harrison White’s model plot. Note: Horizontal axis: volume, vertical axis: quality. The sub-axes distinguish a non-viable (“Failure”) region from a viable one, in turn subdivided into three different regions (“Ordinary”, “Advanced” and “Paradox”). Zone A = Paradox, zone C = Ordinary, zone D = Advanced, all other zones = Failure. N = 105.

Findings highlight three key effects. First, an amplification effect: platforms enable restaurants to see “like a market,” not just through their own customers but also through competitors’ clients. Second, a normalization effect: platform use pushes firms to standardize their offers, fostering similarity without complete homogenization. Third, a duration effect: sustained platform participation depends on quality positioning, although many restaurants exit after a few years, partly in response to platform dominance. These dynamics suggest a broader rationalization process in which platforms make market observation more systematic and efficient.

This perspective nuances common claims about platforms as market “revolutions.” The study finds no evidence that platforms improve consumer–producer matching. None of the interviewed restaurateurs feared empty tables, and some deliberately withheld capacity from the platform to accommodate walk-ins or phone bookings. Overemphasizing intermediation, earlier research may have overlooked subtler effects. The key function of platforms does not always have to be matching. They can play diverse and even unbalanced roles on a single side of the market, without striving toward a competitive supply-demand equilibrium.

The analysis also reaffirms the validity of White’s model. Originally designed for settings where firms observed only volumes, the model still applies when platforms disclose quality through reviews. Its insights hold across different technological contexts.

Finally, the study underscores the limits of using platforms as sources of research data. We relied on platform data, but we faced gaps: available data are partial because platform objectives differ from research needs, and algorithms remain proprietary. This raises concerns, as platforms exert broad societal influence while controlling critical information.

Overall, the research advances understanding of how platforms affect business practices, in this case restaurants. It contributes to critical scholarship that recognizes the novelty of platform intermediation while tempering claims about its benefits.

The study is available in open access here.

A successful INDL-8 conference in Bologna

When we created ENDL (the European Network on Digital Labour), back in 2017, we booked a room with 17 places. A few days ago, the last conference of the network (which in the meantime has become INDL, replacing ‘European’ with ‘International’) hosted about 200 participants. Internationalisation has not only meant numerical growth, but also inclusion of a diverse range of voices: every year, we see more participants from countries that are often under-represented on the scientific scene, from India and South Africa to Argentina and Brazil. Participants have also diversified in another sense, too: if the majority have always been academics, it is a pleasure to see more and more workers, as well as labour organisers. This year, we could for example benefit from the presence of associations of data workers from Kenya, freelancers from France, and content moderators from Spain.

Participants to the INDL-8 conference, Saint-Cristina cloister, Bologna, IT, 10 September 2025.

A conference like this one is meant to give hope – hope of mutual understanding across countries and cultures, hope of dialogue across disciplines and fields, hope of connections between academic research and action. We worked together to ensure a welcoming environment for all, for instance by encouraging constructive comments, rather than sheer criticism, after each paper presentation. We also strived to keep costs down in order to make the conference free of charge, and with the DiPLab research programme, we could give a few small scholarships to promising presenters who might not have been able to travel otherwise.

Two speakers (M Francesco Sinopoli, Fondazione Di Vittorio, and Ms Kauna Malgwi, Uniglobal) at the plenary panel ‘Plenary panel: New Unionism, towards global alliances’, part of the INDL-8 Conference, DAMA Tecnopolo, Bologna, IT, 11 September 2025

Surely, problems remain. A couple potential participants had visa issues, while others had to cancel due to lack of funding. These problems weigh especially hard on people from emerging and lower-income countries outside Europe and North America. The future is also uncertain, as funding sources become increasingly dryer, and visa restrictions tighter. For this reason, the main INDL-9 conference next year (Geneva, ILO, 9-11 September 2026) will be accompanied by the growth of local chapters. The Middle-East and Africa area is preparing its second conference, this time online only, on 25-26 November. In the US, a one-day event will take place at Yale University on 29 April 2026. Colleagues in Chile and Argentina are launching a series of online events.

Closing keynote (Prof. Sandro Mezzadra, chair: Prof. Marco Marrone), Saint-Cristina Aula Magna, Bologna, IT, 12 September 2025

More information on the INDL-8 conference (including the full programme) is available here.

Women in the loop: the gendered contribution of data workers to AI

I presented today, at the WORK2025 conference in Turku, Finland, a paper on the human-in-the-loop systems that integrate human labor into the production of Artificial Intelligence (AI). Beyond engineers who design models, myriad “data workers” prepare training data, verify outputs, and correct errors. Their role is crucial but undervalued, with low pay and poor working conditions. Shaped by outsourcing and offshoring practices, the market for such services has grown steadily over time, with digital platforms acting as the main intermediaries between AI producers and workers. In their communication with clients, these platforms often emphasize that human workers provide nuanced judgment in complex tasks.

The three main functions of micro-work in the development of data-intensive, machine-learning based AI solutions. Source: https://doi.org/10.1177/2053951720919776

But who are the humans in the loop, and whose contributions count? Here, I focus on women’s participation and its evolution as the market expanded. Data work is theoretically well-suited for women, since it can be performed remotely from home. Besides, platforms generally do not share gender information, thereby limiting direct discrimination. One might thus expect women’s representation to be high. However, the statistical evidence is mixed. Across studies, the proportion of women data workers exceeds 50% only in four cases. Besides, reports sometimes differ for the same country, across platforms or at different moments in time. Looking at the lowest reported shares, then in no country except the US do women represent more than 40% of all data workers. Even in the US, recent data indicate that women constitute about half of the data workforce, down from 57-58% some years ago. Why are women underrepresented, and why does the pattern vary across countries?

Highest proportion of women data workers reported in existing studies (incl. own datasets). Source: author’s elaboration, created with MapChart.

Lowest proportion of women data workers reported in existing studies (incl. own datasets). Source: author’s elaboration, created with MapChart.

The earliest explanation comes from P. Ipeirotis (2010), who analyzed Amazon Mechanical Turk, then the dominant platform. Most workers were from the US and India. In the US, data work paid too little to sustain a household and was often taken up by un- and under-employed women seeking supplementary income. In India, dollar-based pay was more attractive and often a main household income, drawing more men into the activity. Later, as the market expanded, this explanation appeared insufficient: the above maps show that not all rich countries have many female data workers, and some lower-income countries do. Yet, my data show a negative correlation: the larger the share of workers for whom data work is the main income source, the smaller the proportion of women. Ipeirotis’s hypothesis still holds but requires updating to today’s more competitive and globalized platform economy.

Proportion of workers for whom data work is the main source of income vs. proportion of women, by country. Source: own survey data (from projects TRIA and ENCORE, 2020-24).

Platforms fragment work into tasks and assign them to individuals framed as independent contractors competing for access. Unlike traditional firms, workers do not collaborate but face intense competition. Outcomes vary by national context. In countries facing stagnation or crisis, such as Venezuela, international platforms offer a rare source of income for highly qualified workers. Competition becomes fierce, and “elite” workers – often young men with STEM backgrounds – dominate. Women are disadvantaged, either due to fewer technical qualifications or because care responsibilities limit their ability to invest in building strong platform profiles and reputations. By contrast, in more dynamic economies such as Brazil, local job markets absorb highly skilled professionals, leaving platform work to more disadvantaged groups. Here, women with family duties are more visible. Thus, platform demographics reflect national conditions: in poorer or crisis-stricken countries, men from the educational elite seek career advancement, while in richer countries, women (especially mothers) take on such work primarily to supplement household income. Women may be equally educated, but they often lack the time to cultivate advanced STEM skills. As platforms demand longer and more specialized tasks, men increasingly gain the upper hand, crowding women out—even in countries where they were once the majority.

Platform design ignores these dynamics. Workers are treated as abstract entities, stripped of the socio-economic and cultural contexts that shape real inequalities. Competition, combined with local conditions, deepens gender gaps. Interventions must therefore consider gender disparities. Otherwise, they risk reinforcing inequalities. Supporting women’s access to data work—particularly those constrained by family responsibilities—can contribute to more balanced labor participation and ensure that AI benefits from a broader diversity of human input.

AI, labour and natural resources in Santiago

Last week in Santiago, Chile, I had the tremendous opportunity to give a keynote speech at the 4th annual workshop of the Millennium Nucleus on the Evolution of Work (M-NEW), of which I am also a senior international member. This interdisciplinary workshop brought together labour scholars from various parts of Latin America and beyond. I really liked the inspiring talks and the friendly and stimulating interactions with colleagues.

My own talk drew on my multi-year research programme on the crucial yet invisible human labor behind the global production of artificial intelligence. I first examined the evolution of this form of work over the last two decades, demonstrating that while its core functions in the development of smart systems have remained consistent, the scope and volume of such tasks have expanded significantly. I then analyzed the organization of this labour at the intersection of three trends in recent globalization: outsourcing, offshoring, and digitalization. These dynamics account for the marginalization of these workers within the tech industry and the relocation of their labor to lower-wage countries. Based on these insights, I described four cases—Venezuela, Argentina, Brazil, and Chile—highlighting the diverse effects of local conditions. I concluded by identifying emerging scientific and policy challenges, particularly concerning the recognition of skills, and the place of the informal economy.

The following week, still in Santiago, I was excited to participate in the kick-off meeting of the new research project SEED (“Social and Environmental Effects of Data connectivity: Hybrid ecologies of transoceanic cables and data centers in Chile and France”), a collaboration between my research group DiPLab and another Millennium Nucleus, FAIR (“Futures of Artificial Intelligence Research”). SEED received joint funding from the ECOS-SUD programme (France) and ANID (Chile) to analyse the AI value chain, from its production and development to its impact on employment, use and environmental consequences, by studying the Valparaíso-Santiago de Chile and Marseille-Paris axes.

My presentation introduced the concept of the ‘dual footprint’ as a heuristic device to capture the commonalities and interdependencies between the different impacts of AI on the natural and social surroundings that supply resources for its production and use. I framed the AI industry as a value chain that spans national boundaries and perpetuates inherited global inequalities. The countries that drive AI development generate a massive demand for inputs and trigger social costs that, through the value chain, largely fall on more peripheral actors. The arrangements in place distribute the costs and benefits of AI unequally, resulting in unsustainable practices and preventing the upward mobility of more disadvantaged countries. The dual footprint grasps how the environmental and social dimensions of AI emanate from similar underlying socio-economic processes and geographical trajectories.

Sunbelt 2025 in Paris: inequalities, weak ties, and networked markets

I am attending a faboulous Sunbelt conference, taking place this year in Paris. I am deeply grateful to Emmanuel Lazega who made gigantic efforts to bring this important conference to France, after a failed attempt in 2020 when the Covid-19 pandemic brought everything online.

Yesterday, a very inspiring keynote by Beate Völker reminded us of the importance of weak ties and even absent ties – not only to smoothen the functioning of job markets but also, more surprisingly, to achieve social cohesion. The keynote took place in the historical Grand Amphithéâtre de la Sorbonne.

I am pleased to have contributed to a set of initiatives in honour of great network sociologist Harrison White, one year after his death. With Elise Penalva-Icher and Fabien Eloire, we presented a paper on digital platforms in White-like producer markets shaped by networks (more on it soon!). The paper was part of a dedicated session on the legacy of Harrison White. There was also a plenary in memoriam, where his former students and friends shared thoughts and stories. (Another plenary honoured Barry Wellman).

I was also honoured to be invited, today, to join a plenary panel on social networks and the study of social inequalities, organized by Gianluca Manzo. While most research on inequalities is attribute-based, social network approaches provide a powerful alternative (or perhaps, complement), highlighting how interpersonal, relational mechanisms generate patterns that over time, lock categorical differences into durable gaps in wealth, status, or other outcomes. We discussed complementarities and differences between these two approaches, the advantages of a network-oriented perspective, but also the methodological challenges that come with it.

On Sunday, I’ll present a paper that also deals with inequalities, in the specific case of online platform workers. We define an index of ‘vulnerability’ to unveil inequalities within this worker population in two countries, France and Spain. The paper develops and deeps results of a previous work, whose first outputs served to inform policy decisions in France.

Another paper to which I have contributed, and which will be presented at this Sunbelt, is more methodological and is the result of a collective effort. We analyze over 20 years of publications in the journal REDES and highlight how researchers have reported relational data from both personal and complete networks. We identify key challenges in the consistency and transparency of reporting and propose 7 practical recommendations to improve clarity, comparability, and replicability in social network research. This paper is already published in REDES.

Desinformación, trabajo y plataformas en Buenos Aires

Acabo de regresar de un viaje muy lindo a Argentina, donde fui invitada por el Instituto francés para participar en varios eventos.

El 12 de mayo, participé en la conferencia “Manipulación Informativa e Injerencia Extranjera: Desafíos Globales y Respuestas Democráticas”, organizada por la Delegación de la Unión Europea en Argentina y por varias embajadas (como la de Francia). En el panel “Cómo contrarrestar la desinformación respetando la libertad de expresión y el derecho a la información”, hablé de cómo la desinformación se financia a través del mercado publicitario que sustenta toda la web, y de la necesidad de regular este mercado. También destaqué la importancia de fortalecer la investigación científica sobre este tema – como en el proyecto AI4TRUST, financiado por la propia UE.

Al día siguiente, tuve dos encuentros con estudiantes de periodismo, en la Universidad Nacional de Avellaneda y en la Universidad Abierta Interamericana, también sobre temas relacionados con la desinformación en internet. Fue un honor y un placer ver, en cada caso, la sala llena y mucho interés. Los/as estudiantes hicieron muchas preguntas y demostraron una gran disposición a aprender y progresar.

Los días 16 y 17 de mayo participé en tres paneles organizados en el marco de la Noche de las Ideas, una iniciativa del Instituto francés que se organiza todos los años y esta vez tuvo lugar en el famoso Teatro Colón de Buenos Aires. Fui en la sesión inaugural sobre el tema de este año, “El poder de actuar”. Además, participé como ponente en un debate muy interesante sobre el trabajo en plataformas digitales que se titulaba provocativamente ¿Nuevas servidumbres voluntarias? Jóvenes y precariedad y en otro sobre la inteligencia artificial: “In A.I. we trust?”. Actuar con y en contra de las nuevas tecnologías.

El 20 de mayo, di una charla sobre “El futuro del trabajo y la IA” como parte del ciclo UBA Digital, en la Universidad de Buenos Aires. Presenté algunos resultados de mi investigación sobre el trabajo digital y su papel en la producción de IA, desarrollada en el marco del programa de investigación DiPLab. Una vez más, me alegró ver a muchos participantes con preguntas muy interesantes. Fuimos acogidos por la facultad de odontología y también tuvimos la extraordinaria oportunidad de visitar la clínica.

Uninvited protagonists: the social networks of Venezuelan AI data workers

After years of work, the long-awaited good news: my article ‘Uninvited Protagonists: The Networked Agency of Venezuelan Platform Data Workers‘, co-authored with Juana Torres-Cierpe, has just been published in New Technology, Work and Employment!

Workers in Venezuela are powering AI production, often under tough conditions. Sanctions and a deep political-economic crisis have pushed them to work for platforms that pay in US dollars, albeit at low rates. They constitute a large reservoir for technology producers from rich countries. But they are not passive players.

They build resilience, rework their environment, and sometimes engage in acts of resistance, with support from different segments of their personal networks. From strong local ties to loose online connections, these informal webs help them cope, adapt, and occasionally push back. Their diversified relationships comprise an unofficial and often hidden, albeit largely digitised relational infrastructure that sustains their work and shapes collective action.

These findings invite to rethink agency as embedded in workers’ personal networks. To respond to adversities, one must liaise with equally affected peers, with family and friends who offer support, etc. Social ties ultimately determine who is enabled to respond, and who is not; whether any benefits and costs are shared, and with whom; whether any solution will be conflictual or peaceful. Social networks are not accessory but constitute the very channel through which Venezuelan data workers cope with hardship.

Not all relationships play the same role, though. Venezuelans discover online data work through their strong ties with family, close friends, and neighbours. To convert their online earnings into local currency, they rely on their broader social networks of relatives and friends living abroad and indirect relationships with intermediaries. For managing their day-to-day activities, Venezuelans expand their social networks through online services like Facebook, WhatsApp, and Telegram, connecting with diverse and less-close peers within and outside the country. Different social ties affect the various stages of the data working experience.

Overall, no Venezuelan could work alone – and the networked interactions that sustain each of them against hardship have made them massively present, as ‘uninvited protagonists,’ in international platforms. Their massive presence in the planetary data-tasking market is a supply rather than demand-driven phenomenon.

This analysis also sheds light on the reasons why mobilisation is uncommon among platform data workers. Other studies noted diverging orientations of workers, unclear goals, lack of focus, and insufficient leadership. Another powerful reason hinges upon the predominance of weak ties in building up online group membership: indeed, distant acquaintances are insufficient to prompt people to action if their intrinsic motivations are low.

The article is available in open access here.

Call for Abstracts: INDL-8 Conference in Bologna

It is my pleasure to announce that the call for abstracts for the upcoming INDL-8 conference is now open.

The conference reaches Italy this year. It will take place in the most ancient University in the western world, Bologna, on 10-12 September 2025.

The overarching topic of this year’s conference is ‘Contesting Digital Labor: Resistance, counteruses, and new directions for research’. The goal is to explore how platform workers navigate, challenge, and reshape algorithmic management systems while forging innovative forms of solidarity and collective action. We also aim to explore the perspectives that technological developments open for workers in order to escape everyday surveillance, to resist top-down control and to organise to defend their rights.

In addition to presentations that directly address these questions, we welcome proposals that analyse a broader range of issues related to digital labour.

To read the full Call and submit your abstract, please visit the conference management website.

The deadline is 27 April 2025.

NB: A small number of scholarships to partly cover travel/subsistence costs will be made available – stay tuned for more information.

NB2: The keynotes and plenaries will be announced very soon.

Please feel free to share with any scholars and postgraduate students who might be interested.

The digital labour of AI in Latin America

Another article has just been published! Another one that is based on a DiPLab-based group collaboration (with A.A. Casilli, M. Fernández Massi, J. Longo, J. Torres Cierpe and M. Viana Braz) and that uses data from multiple countries. It is entitled ‘The digital labour of artificial intelligence in Latin America: a comparison of Argentina, Brazil, and Venezuela’ and is part of a special issue of Globalizations on ‘The Political Economy of AI in Latin America’. This article lifts the veil on the precarious and low-paid data workers who, from Latin America, engage in AI preparation, verification, and impersonation, often for foreign technology producers. Focusing on three countries (Argentina, Brazil, and Venezuela), we use original mixed-method data to compare and contrast these cases in order to reveal common patterns and expose the specificities that distinguish the region.

The analysis unveils the central place of Latin America in the provision of data work. To bring costs down, AI production thrives on countries’ economic hardship and inequalities. In Venezuela and to a lesser extent Argentina, acute economic crisis fuels competition and favours the emergence of ‘elite’ (young and STEM-educated) data workers, while in more stable but very unequal Brazil, this activity is left to relatively underprivileged segments of the workforce. AI data work also redefines these inequalities insofar as, in all three countries, it blends with the historically prevalent informal economy, with workers frequently shifting between the two. There are spillovers into other sectors, with variations depending on country and context, which tie informality to inequality.

Our study has policy implications at global and local levels. Globally, it calls for more attention to the conditions of AI production, especially workers’ rights and pay. Locally, it advocates solutions for the recognition of skills and experience of data workers, in ways that may support their further professional development and trajectories, possibly also facilitating some initial forms of worker organization.

The version of record is here, while an open-access preprint is available here.

Where does AI come from?

I am thrilled to announce that an important article has just seen the light. Entitled ‘Where does AI come from? A global case study across Europe, Africa, and Latin America’, it is part of a special issue of New Political Economy on ‘Power relations in the digital economy‘. It is the result of joint work that I have done with members of the Diplab team (A.A. Casilli, M. Cornet, C. Le Ludec and J. Torres Cierpe) on the organisational and geographical forces underpinning the supply chains of artificial intelligence (AI). Where and how do AI producers recruit workers to perform data annotation and other essential, albeit lower-level supporting tasks to feed machine-learning algorithms? The literature reports a variety of organisational forms, but the reasons of these differences and the ways data work dovetails with local economies have remained for long under-researched. This article does precisely this, clarifying the structure and organisation of these supply chains, and highlighting their impacts on labour conditions and remunerations.

Framing AI as an instance of the outsourcing and offshoring trends already observed in other globalised industries, we conduct a global case study of the digitally enabled organisation of data work in France, Madagascar, and Venezuela. We show that the AI supply chains procure data work via a mix of arm’s length contracts through marketplace-like platforms, and of embedded firm-like structures that offer greater stability but less flexibility, with multiple intermediate arrangements that give different roles to platforms. Each solution suits specific types and purposes of data work in AI preparation, verification, and impersonation. While all forms reproduce well-known patterns of exclusion that harm externalised workers especially in the Global South, disadvantage manifests unevenly depending on the structure of the supply chains, with repercussions on remunerations, job security, and working conditions.

Marketplace- and firm-like platforms in the supply chains for data work in Europe, Africa, and Latin America. Dark grey countries: main case studies, light grey countries: comparison cases. Organisational modes range from almost totally marketplace oriented (darker rectangle, Venezuela) to almost entirely firm oriented (lighter rectangle, Madagascar). AI preparation (darker circle) is ubiquitous, but AI verification (darker triangle) and AI impersonation (darker star) tend to happen in ‘deep labour’ and firm-like organisations where embeddedness is higher.

We conclude that responses based only on worker reclassification, as attempted in some countries especially in the Global North, are insufficient. Rather, we advocate a policy mix at both national and supra-national levels, also including appropriate regulation of technology and innovation, and promotion of suitable strategies for economic development.

The Version of record is here, while here is an open access preprint.