paolatubaro

Brazil in the global AI supply chains: the role of micro-workers

AI is not just a Silicon Valley dream. It relies among other things, on inputs from human workers who generate and annotate data for machine learning. They record their voice to augment speech datasets, transcribe receipts to provide examples to OCR software, tag objects in photographs to train computer vision algorithms, and so on. They also check algorithmic outputs, for example, by noting whether the outputs of a search engine meet users’ queries. Occasionally, they take the place of failing automation, for example when content moderation software is not subtle enough to distinguish whether some image or video is appropriate. AI producers outsource these so-called “micro-tasks” via international digital labor platforms, who often recruit workers in Global-South countries, where labor costs are lower. Pay is by piecework, without any no long-term commitment and without any social-security scheme or labor protection.

In a just-published report co-authored with Matheus Viana Braz and Antonio A. Casilli, as part of the research program DiPlab, we lifted the curtain on micro-workers in Brazil, a country with a huge, growing, and yet largely unexplored reservoir of AI workers.

We found among other things that:

Three out of five Brazilian data workers are women, while in most other previously-surveyed countries, women are a minority (one in three or less in ILO data).
9 reais (1.73 euros) per hour is the average amount earned on platforms.
There are at least 54 micro-working platforms operating in Brazil.
One third of Brazilian micro-workers have no other source of income, and depend on microworking platforms for subsistence.
Two out of five Brazilian data workers are (apart from this activity) unemployed, without professional activity, or in informality. In Brazil, platform microwork arises out of widespread unemployment and informalization of work.
Three out of five of data workers have completed undergraduate education, although they mostly do repetitive and unchallenging online data tasks, suggesting some form of skill mismatch.
The worst microtasks involve moderation of violent and pornographic contents on social media, as well as data training in tasks that workers may find uncomfortable or weird, such as taking pictures of dog poop in domestic environments to train data for “vacuuming robots”.
Workers’ main grievances are linked to uncertainty, lack of transparency, job insecurity, fatigue and lack of social interaction on platforms.

To read the report in English, click here.

To read the report in Portuguese, click here.

Research ethics in the age of digital platforms

I am thrilled to announce the (open access) publication of ‘Research ethics in the age of digital platforms‘ in Science and Engineering Ethics, co-authored with José Luis Molina, Antonio A. Casilli & Antonio Santos Ortega.

We examine the implications of the use of digital micro-working platforms for scientific research. Although these platforms offer ways to make a living or to earn extra income, micro-workers lack fundamental labour rights and ‘decent’ working conditions, especially in the Global South. We argue that scientific research currently fails to treat micro-workers in the same way as in-person human participants, producing de facto a double morality: one applied to people with rights acknowledged by states and international bodies (e.g. Helsinki Declaration), the other to ‘guest workers of digital autocracies’ who have almost no rights at all.

Doctoral thesis: Dynamics of collective elaboration of (in)appropriate information in social networks

We’re hiring!

As part of a large, interdisciplinary European research project, we are seeking a motivated, open-minded student to join CNRS (specifically, the Centre for Research in Economics and Statistics, CREST) in Palaiseau, France, for three years.

The thesis aims to model the production and dissemination of ‘fake news’ in situations of uncertainty and socio-economic inequality. A rich sociological literature suggests that actors contextualise messages received and emitted as questions or answers, interpret them according to their recipients and senders, and assess their social acceptability within their own networks of relationships, taking into account their relative position. Building on this research, the goal is to identify the social processes underpinning misinformation-generating digital communications: collective identity, inequalities of status or authority, hierarchy of shared norms. This will enable interpreting the online social interactions through which actors collectively judge the (appropriate or inappropriate) quality of a message or information and then decide whether to relay or share it – and with whom. In particular, the thesis work will contribute to: 1/ drawing up a state of the art, mainly within sociology but open to the neighbouring disciplines which have also addressed these questions; 2/ illustrating and testing these theories through an empirical analysis of a digital database, mainly with quantitative methods, which may be enriched through a small complementary qualitative fieldwork; 3/ to contribute to the preparation of guidelines that help information professionals and policy-makers to detect the sources and modalities of emergence and propagation of misinformation.

The thesis will be done within the framework of the interdisciplinary project “AI-based-technologies for trustworthy solutions against disinformation” (AI4TRUST), funded by the European Union over the period 2023-2026, involving 17 partners (research institutions and media professionals) in 10 countries, and coordinated by Fondazione Bruno Kessler (Italy).

The AI4TRUST project aims to build a hybrid system, with advanced artificial intelligence solutions capable of cooperating with humans in the fight against disinformation. The new algorithms that will be developed in this framework, constantly checked and improved by human fact-checkers, will monitor multiple online social platforms in nearly real time, analysing text, audio, and visual contents in several languages. The resulting quantitative indicators, including infodemic risk, will be inspected under the lens of social and computational social sciences, to build the trustworthy elements required by media professionals.

CNRS contributes to the study of the sociological dimension of these issues, and participates in the project through its laboratories Centre Marc Bloch (CMB, Berlin), Centre de Sociologie des Organisations (CSO, Paris) and Centre de Recherche en Economie et Statistique (CREST, Palaiseau). In practice, the thesis will be carried out at CREST, and co-directed by representatives of the three laboratories involved in this AI4TRUST: myself, Emmanuel Lazega (CSO) and Camille Roth (CMB).

The successful candidate will have the opportunity to join a group of highly motivated scientists and practitioners from across the continent; to participate in collaborations with other teams working on the project in an interdisciplinary framework; to attend regular meetings with the project’s principal Investigator, the scientists and experts involved, and public decision-makers; to present and publish research results in international conferences and journals.

The ideal candidate has a good background in quantitative sociology or in a STEM discipline (e.g., mathematics, statistics, computer science) with a strong interest in societal issues and challenges. A very good knowledge of English, an interdisciplinary approach and the ability to work in teams are essential.

Candidates should apply on the CNRS portal, where they will also find more details.

INDL-6 Conference: CfP now open

We are excited to announce the 6th Conference of the International
Network on Digital Labor (INDL-6), scheduled to take place 9-11 October, 2023. The conference aims to bring together experts from various
fields to discuss the latest research findings and share ideas on the
topic of Digital Labor in the Wake of Pandemic Times. Following
long-term technological trends as well as exogenous shocks, the field of
digital labor is constantly expanding. This year’s INDL conference will
be an excellent opportunity to exchange insights and perspectives, as
well as a great way to make new friends among researchers, workers,
policymakers, and practitioners who study the future of work, social
justice, platforms, and artificial intelligence (AI).

The INDL-6 conference will be held in-person at the Weizenbaum Institute
for the Networked Society in Berlin, Germany. It is co-organized by the
International Labor Organization (ILO), the Digital Platform Labor (DiPLab) group, and Wissenschaftszentrum Berlin für Sozialforschung (WZB).

We encourage all interested researchers, post-graduate students, and practitioners to submit proposals that address aspects of digital labor, including but not limited to: gig economy, online labor, workplace surveillance, algorithmic management, AI-assisted recruiting, remote work, employee well-being, inequality, policy responses to Covid-19 crisis, regulation, organizing digital workers, gender and work, LGBTQ+ workers, intersectionality, disability, inclusion, AI, decolonial lens, informal labor markets, generative AI and work.

We welcome submissions that are interdisciplinary in nature and strongly
encourage proposals by researchers and practitioners from the Global
South across all topics.

The Call for Papers is available here and the deadline is 12 April.

How much does a face cost?

Three to five dollars: that’s the answer. As simple as that. I am talking about the behind-the-curtain market for personal data that sustains machine learning technologies, specifically for the development of face recognition algorithms. To train their models, tech companies routinely buy selfies as well as pictures or videos of ID documents from little-paid micro-workers, mostly from lower-income countries such as Venezuela and the Philippines.

Josephine Lulamae of Algorithm Watch interviewed me for a comprehensive report on the matter. She shows how, in this globalized market, the rights of workers are hardly respected – both in terms of labour rights and of data protection provisions.

I saw many such cases in my research of the last two years, as I interviewed people in Venezuela who do micro-tasks on international digital platforms for a living. Their country is affected by a terrible economic and political crisis, with skyrocketing inflation, scarcity of even basic goods, and high emigration. Under these conditions, international platforms – that pay little, but in hard currency – have seen a massive inflow of Venezuelans since about 2017-18.

Some of the people I interviewed just could not afford to refuse a task paid five dollars – at a moment in which the monthly minimum wage of Venezuela was plummeting to as little as three dollars. They do tasks that workers in richer countries such as Germany and the USA refuse to do, according to Lulamae’s report. Still, even the Venezuelans did not always feel comfortable doing tasks that involved providing personal data such as photos of themselves. One man told me that before, in better conditions, he would not have done such a task. Another interviewee told me that in an online forum, there were discussions about someone who had accepted to upload some selfies and later found his face in an advertisement on some website, and had to fight hard to get it removed. I had no means to fact-check whether this story was true, but the very fact that it circulated among workers is a clear sign that they worry about these matters.

On these platforms that operate globally, personal data protection does not work very well. This does not mean that clients openly violate the law: for example, workers told me they had to sign consent forms, as prescribed in the European General Data Protection Regulation (GDPR). However, people who live outside of Europe are less familiar with this legislation (and sometimes, with data protection principles more generally), and some of my interviewees did not well understand consent forms. More importantly, they have few means to contact clients, who typically avoid revealing their full identity on micro-working platforms – and therefore, can hardly exert their rights under GDPR (right to access, to rectification, to erasure etc.).

The rights granted by GDPR are comprehensive, but do not include property rights. The European legislator did not create a framework in which personal data to be sold and bought, and rather opted for guaranteeing inalienable rights to each and every citizen. However, this market exists and is flourishing, to the extent that it is serving the development of state-of-the-art technologies. Its existence is problematic, like the ‘repugnant’ markets for, say, human organs or babies for adoption, where moral arguments effectively counter economic interest. It is a market that thrives on global inequalities, and reminds of the high price to pay for today’s technical progress.

See the full report here.

Ciclo de charlas en Chile sobre inteligencia artificial, trabajo y redes sociales

Estoy muy emocionada y feliz de empezar un ciclo de charlas en Chile, principalmente en Santiago y Talca, con Antonio A. Casilli este mes de enero. Agradezco mucho a la Embajada de Francia en Chile, al Instituto Francés de Chile, y a la Fundación Teatro a Mil por esta oportunidad maravillosa. Gracias también a Juana Torres Cierpe y a Francisca Ortiz Ruiz por su ayuda en contactar con colegas, amigos y estudiantes de Chile.

Empezaremos por una charla titulada “Plataformas digitales, trabajo en línea y automatización tras la crisis sanitaria”, que tendrá lugar el día lunes 16 de enero a las 12:00 hrs en la sede de la CUT (1 oriente # 809, Talca). En esta charla presentaremos nuestras investigaciones sobre el fenómeno del micro-trabajo fuertemente precarizado que se desarrolla en las plataformas digitales. Agradezco mucho a la profesora Claudia Jordana Contreras y a la Escuela de Sociología de la Universidad Católica del Maule por la organización de este evento.

El martes 17 enero 2023, 11:00, hablaré de “Inteligencia artificial, transformaciones laborales y desigualdades: El trabajo de las mujeres en las plataformas digitales de ‘microtareas” en el Instituto de Sociología de la Universidad Católica y con el Quantitative and Computational Social Science Research Group. Gracias a Mauricio Bucca que ha organizado este evento. Estaremos en la Pontificia Universidad Católica de Chile, Campus San Joaquín.

El martes 17 por la tarde (a las 17:000 hrs), hablaré de “Ética de la inteligencia artificial y otros desafíos para la investigación sobre redes sociales” como parte de la Escuela de Verano del Centro de Investigación en Complejidad Social, Universidad del Desarrollo. Agradezco a Jorge Fábrega Lacoa y sus colegas para la organización.

El martes 17 a las 10:000 hrs, también habrá una ponencia de Antonio Casilli en el evento Congreso Futuro: “Trabajo global y inteligencia artificial. Los ‘ingredientes humanos’ ed la automatización” (Teatro Oriente, Pedro de Valdivia 099, Providencia).

El viernes 20 de enero 2023, a las 10:00 hrs, Antonio y yo hablaremos juntos de “El trabajo detrás de la inteligencia artificial y la automatización en América Latina” en un taller internacional organizado por la Universidad de Chile – con Pablo Pérez (gracias por la organización!) y Francisca Gutiérrez, sala 129, FASCO, Av. Ignacio Carrera Pinto 1045, Ñuñoa.

Sigue un evento organizado por el Instituto Francés, “La noche de las ideas”:

Viernes 20 enero 2023, 20:00 hrs, Centro cultural La Moneda, Noche de las Ideas, Santiago — Paola Tubaro “Automatización: ¿El fin del humano?” (con con Denis Parra y Javier Ibacache, Plaza de la Ciudadanía 26, Santiago).

Sabado 21 enero 2021, 16:00 hrs, Centro cultural La Moneda, Noche de las Ideas, Santiago — Antonio Casilli “¿Qué esconde la inteligencia artificial?” (con José Ulloa, Constanza Michelson y Paula Escobar, Plaza de la Ciudadanía 26, Santiago).

El miércoles 26 de enero 2023, a las 18:30 hrs en Santiago, habrá la presentación del libro de Antonio Casilli, “Esperando a los robots. Investigación sobre el trabajo del clic” (LOM, 2021) (con Paulo Slachevsky, Librería del Ulises Lastarria, José Victorino Lastarria 70, local 2, Paseo Barrio Lastarria).

Todos los eventos son gratuitos. Para la Noche de las Ideas y el Congreso Futuro, es necesario inscribirse online.

Artificial Intelligence and Globalization: Data Labor and Linguistic Specificities (AIGLe)

We organized the one-day conference AIGLe on 27 October 2022 to present the outcomes of interdisciplinary research conducted by our DiPLab teams in French-speaking African countries (ANR HuSh Project) and Spanish-speaking countries in Latin America (CNRS-MSH TrIA Project). Both initiatives study the human labor necessary to generate and annotate the data needed to produce artificial intelligence, to check outputs, and to intervene in real time when algorithms fail. Researchers from economics, sociology, computer science, and linguistics shared exciting new results and discussed them with the audience.

AIGLe is part of the project HUSh (The HUman Supply cHain behind smart technologies, 2020-2024), funded by ANR, and the research project TRIA (The Work of Artificial Intelligence, 2020-2022), co-financed by the CNRS and the MSH Paris Saclay. This event, under the aegis of the Institut Mines-Télécom, was organized by the DiPLab team with support of ANR, MSH Paris-Saclay and the Ministry of Economy and Finance.

PROGRAM
9:00 – 9:15 Welcome session

9:15 – 10:40 – Session 1 – Maxime Cornet & Clément Le Ludec (IP Paris, ANR HUSH Project): Unraveling the AI Production Process: How French Startups Externalise Data Work to Madagascar. Discussant: Mohammad Amir
Anwar (U. of Edinburgh)

10:45 – 11:00 Coffee Break

11:00 – 12:30 – Session 2 – Chiara Belletti and Ulrich Laitenberger (IP Paris, ANR HUSH Project): Worker Engagement and AI Work on Online Labor Markets. Discussant: Simone Vannuccini (U. of Sussex)

12:30 – 13:30 Lunch Break

13:30 – 15:00 Session 3 – Juana-Luisa Torre-Cierpe (IP Paris, TRIA Project) & Paola Tubaro (CNRS, TRIA Project): Uninvited Protagonists: Venezuelan Platform Workers in the Global Digital Economy. Discussant:
Maria de los Milagros Miceli (Weizenbaum Institut)

15:15 – 15:30 Coffee Break

15:30 – 17:00 Session 4 – Ioana Vasilescu (CNRS, LISN, TRIA Project), Yaru Wu (U. of Caen, TRIA Project) & Lori Lamel (LISN CNRS): Socioeconomic profiles embedded in speech : modeling linguistic variation in
micro-workers interviews. Discussant: Chloé Clavel (Télécom Paris, IP Paris)

Pathways in Network Science

I’m happy and honoured to speak today at the “Pathways in Network Science” online seminar of the Women in Network Science (WiNS) group.

Pathways in Network Science aims to give the stage to women and nonbinary researchers in network science to share their career paths or some pertinent aspects of it. Presentations can be a summary of the research topics explored along a speaker’s career path or even an autobiographical presentation about how opportunities and challenges influenced her aspirations and impacted her career path. It can also include discussing gender-related challenges and experience with individual strategies and/or systemic changes.

I’ll talk about myself in terms of mobility – both geographic and disciplinary – and the challenges and opportunities it represented. I’ll also talk about resilience – or how network science helped me to make true my dream of devoting my life to research. I’ll mention impact – or how to think of the place of science in society, and how network research can lead to positive change. I’ll conclude with the challenges ahead – and how they are not only scientific, but also deeply human and social.