Artificial intelligence – Data Big and Small

Chile and the materiality of AI

After having studied the impacts of AI on labour and society for years, I am now joining forces with colleagues interested in the materiality of AI and its appetite for natural resources – energy, water, and land. As part of the collaborative project SEED, with the DiPLab and FAIR teams, I spent almost a month doing fieldwork in Chile to dig deeper into the matter.

Google’s data center in Quilicura, Chile.

Chile is an especially interesting case study. In the global industry of AI, it is a peripherical country; but it is a rich country, highly digitized and with the best connectivity in the world, where use of AI tools is already widespread. It is not a leader, but it is a hub that provides resources. In Chile, all the phases of the supply chain of AI can be observed, starting from the mines of rare earths in Penco, and of copper and lithium in the Atacama Desert, to the data centers that populate (especially, though not only) the Quilicura area, the submarine cables that connect Valparaiso to the world through the Pacific, and the research labs in Santiago. Chile is also emblematic of the frictions that may arise as a result, and of the potential societal reactions – the most telling example being the quili.ai initiative to raise worldwide awareness about the consumption of water by data centers and the ensuing risks for the population. Yet, not all the potentially concerned actors have sufficient capacity to anticipate these impacts, especially outside the capital.

After an interdisciplinary panel at the intersection of AI, mineral extraction, ecology, and planetary habitability at Museo Chileno de Arte Precolombino, Santiago, 2 July 2026.

During almost a month, I could address these issues very productively with fellow researchers, policymakers, environmental judges, activists, and artists. The result is a rich baggage of information, and an extremely helpful basis for further research and action.

The dual footprint of AI, in the green-digital transition

I am delighted to be part of a broad reflection, initiated and promoted by Edemilson Paraná, that critically interrogates the convergence of environmental sustainability and digitalization, in a special issue just published in the journal Globalizations. Governments and multi-lateral institutions frame the “twin transition” as a strategy to foster economic growth and durable social prosperity at the same time. However, Paraná argues in his introductory article (co-authored with Rodrigo Santaella-Gonçalves), this agenda is shaped by market logics and geopolitical competition, in ways that ultimately contradict the social and ecological goals it supposedly supports. The special issue has been designed to offer a systemic political-economy perspective on this inconsistency, grounded in critical social sciences and heterodox economics. Overall, it shows how the structural drivers of digitalisation can constrain its green potential and identifies pathways for redirecting the transition towards socially equitable and ecologically sustainable transformations.

Ulysse Gerkens & FARI / “Server Pool” / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/ This image subverts the swimming pool of the Villa Empain, a Brussels contemporary art venue. The pool is drained of its water and filled with computer servers. The image makes visible the massive water consumption required to cool data centers, but also questions the allocation of resources: when enormous budgets shift toward digital infrastructure, what remains for culture?

My own article within this special issue aims to reconcile two separate trends in the study of the impacts of artificial intelligence (AI), one focused on the natural and the other on the social surroundings that supply resources for its production and use. I introduce the concept of the ‘dual footprint’ as a heuristic device to capture the commonalities and interdependencies between them. Originally borrowed from ecology, the concept denotes here the total impacts on the environment and society generated by AI’s production and use. It is an indicator of sustainability insofar as it grasps the degree to which the AI industry is failing to ensure the maintenance of the socio-economic systems and environmental conditions necessary to its production. To develop the concept in this way, it is necessary to (provisionally) renounce some of the accounting flavour of extant footprint measures, allowing for a more descriptive interpretation. In my article, the dual footprint serves as a mapping tool, linking impacts to specific locations and to the people and groups that inhabit them.

I use two in-depth case studies, each illustrating international flows of raw materials (nature) and of labour services (society). Case studies are a preliminary illustration of a broader idea, that will need to be developed further: that the AI industry is a value chain that spans national boundaries and perpetuates inherited global inequalities. Specifically, I use the case of Argentina as a provider to the United States and Madagascar as a provider to France, Japan and South Korea. It appears that the countries that drive AI development (here, the United States, France, Japan and South Korea) generate a massive demand for inputs and trigger social costs that, through the value chain, largely fall on more peripheral actors (here, Argentina and Madagascar). While Argentina and Madagascar differ under many respects, in both cases initial hopes of jobs and prosperity have failed to materialise as predominant outsourcing arrangements reinforce informality and precariousness without preventing environmental damage. Put differently, the arrangements in place distribute the costs and benefits of AI unequally, resulting in unsustainable practices and preventing the upward mobility of relatively more disadvantaged countries. The concept of the dual footprint is useful because it grasps how the environmental and social dimensions of the dual footprint emanate from similar underlying socio-economic processes and geographical trajectories.

A prior presentation of this idea can be found here.

A pre-print version of the article (green open access) can be found here.

INDL-9 conference on ‘AI supply chains’

The Call for Papers for the ninth conference of the International Network on Digital Labor (INDL-9) is now open!

INDL-9 will take place between 9 and 11 September 2026 and will be hosted by the International Labour Organization in beautiful Geneva, Switzerland.

INDL conferences provide a unique opportunity to share knowledge and new perspectives in research and practice related to digital labor.

The topic of this year, “AI Supply Chains,” has the ambition to build an interdisciplinary research agenda on AI and work. Submissions are particularly encouraged in the following thematic areas:

• Transparency and traceability in the AI models and their supply chains
• Working conditions, occupational safety and health of workers in the human-in-the-loop
• Best practices for ethical AI procurement and corporate social responsibility in data labelling
• Role of social dialogue in governing AI-mediated work
• Organisational, legal and financial perspectives on the rate of investment of ethical AI and challenges of regulatory compliance (eg., the EU AI Act)
• New frameworks for a “human-centric” AI supply chain
• Ecological impacts and environment sustainability of AI infrastructures

Proposals are also invited in topic areas that previously garnered substantial interest from conference presenters:

• Algorithmic management, labor control, and workers’ resistance
• Platform cooperativism and alternative business models
• Legal frameworks, regulatory initiatives, and institutional responses to platform labor
• Gender and digital labor

This edition of the INDL-9 conference is organised through a collaborative partnership between the ILO (International Labour Organization), the TASC Platform of the Geneva Graduate Institute’s Centre for Trade and Economic Integration, DiPLab (Digital Platform Labor), ACM SIGCAS (the Association for Computing Machinery Special Interest Group on Computers and Society), and Yale University.

Abstracts can be submitted on or before 7 May 2026 here.

What does AI do to Job Quality? Platformisation, Fissured Workplaces and Dispersion

Contemporary debates on AI and labour often focus on quantitative impacts, such as potential job losses. Yet, the qualitative question of how AI reshapes the experience of work is equally critical. A recent open-access book chapter that I co-authored with Antonio A. Casilli examines these transformations, highlighting their already tangible, if not yet final, effects on job quality.

New book: *Job Quality in a Turbulent Era*, edited by A. Piasna & J. Leschke, 2026, Edward Elgar (open access)

The analysis starts with AI deployment, that is, the introduction of AI solutions into the workplace – for example, use of automated transcription tools. It is typically framed as a productivity enhancer that can improve job quality by making work more engaging – and in some cases, even safer. However, critics argue that firms often use AI to intensify labour through tighter managerial control and surveillance. In this scenario, AI may degrade job quality by restricting autonomy, reinforcing algorithmic management, and limiting workers’ ability to negotiate their roles or pace.

But before AI tools reach the workplace, they must be designed, built, and marketed. Today’s AI production relies heavily on machine learning, which requires vast datasets. This process involves extensive “data work,” performed by myriad lower-level workers in three key functions: preparation (data generation and annotation), verification (quality control of AI outputs), and impersonation (manually performing tasks meant to be automated). While this process increases labour demand the quality of data work is often poor, with low wages, job insecurity, and misrecognition of skills. The root of these issues lies in labour platformization.

The Preparation – Verification – Impersonation functions of data work in AI production. Source: Tubaro et al. 2020.

To see this, let’s first take a step back and consider that most data work is procured through outsourcing (shifting lower-value functions outside the firm) and offshoring (relocating work to lower-wage countries). These practices exclude externalised workers from the resources of lead firms, creating a “planetary market” where AI companies in the US and Europe source labour from countries like Venezuela or Madagascar. Digital platforms amplify these trends, enabling on-demand outsourcing to precarious micro-providers globally. Platformization thus exacerbates the negative effects of outsourcing and offshoring, degrading job quality.

Platformisation is best understood at the crossroads of two broader socioeconomic tendencies. First, the “fissured workplace“, where companies market goods or services without employing all those who contribute to their design, assembling or delivery. This model heightens insecurity, as metrics and surveillance systems erode autonomy and increase stress. Offshore platform workers, in particular, face isolation, losing traditional workplace camaraderie and support. Second, “dispersion“, the constant need to manage interruptions, multitask, and prioritise competing demands. AI-driven tools, such as real-time scheduling algorithms and productivity trackers, intensify dispersion, blurring boundaries between professional and personal life and increasing mental strain.

Overall, then, the effects of AI on job quality are mixed. While AI deployment can enhance productivity and safety, it often intensifies labour and undermines autonomy. Meanwhile, AI production creates jobs, but these are frequently precarious and low-quality. Understanding these dynamics is essential for shaping equitable futures that leverage the potential of contemporary technologies while safeguarding labour rights and well-being.

A detailed digital collage inspired by the aesthetic of medieval manuscript illustrations, depicts a vibrant scene of construction workers building a structure. The traditional elements, such as figures in traditional Persian gowns, wooden ladders, ropes, and woven sacks, are interwoven with modern technological motifs like circuit boards, QR codes, and cloud icons. Bright, glowing, golden networks of interconnected nodes and symbols of digital technology overlay the scene, blending the past and present. On the left, Arabic calligraphy is imprinted onto the paper. The composition uses contrasting textures and colours to juxtapose and interweave the themes and eras. — Shady Sharify / Who is AI Made Of / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/

This is not a story of technological determinism. AI alone does not destroy jobs or degrade their quality; rather, its impacts stem from the broader economic forces that shape its presence and role in today’s economies. The above artwork captures this idea by foregrounding humanity’s collective endeavour in building artificial intelligence, drawing inspiration from Persian miniature painting. In sum, the future of work in the age of AI will hinge on political choices about production organisation and the distribution of its benefits and costs.

The book chapter can be found as: Casilli, A.A. & Tubaro, P. 2026. “What is AI Doing to Job Quality? Platformization, Fissured Workplaces and Dispersion”. Chapter 3 in A. Piasna & J. Leschke (eds.) Job Quality in a Turbulent Era. Cheltenham, UK: Edward Elgar Publishing, pp. 45-60, https://doi.org/10.4337/9781035343485.00009

A video of the launching event of the book – including my presentation of the chapter – is available here.

Outsourcing pain: the hidden health and safety costs of AI production

The video of my presentation at the 2025 annual conference of the European Trade Union Institute (ETUI) on Occupational Safety and Health (OSH), is now out! This year, the conference was dedicated to the age of artificial intelligence.

In my presentation, I challenged common narratives of automation by revealing how, behind the “magic” of contemporary AI, lies intensive human labour. This includes the often-invisible work of data annotators, content moderators, translators, voice actors, and numerous other contributors who make AI systems function. OSH issues in AI production arise directly from the organization of this work. The combination of outsourcing, offshoring, and digital intermediation creates precarious labor conditions that significantly affect workers’ mental health and well-being. I highlighted three critical dimensions of occupational health risks:

Stress from uncertainty and long/unusual working hours: Data workers face unstable employment conditions, irregular schedules, and the constant pressure of uncertain income streams.
Social isolation: The digitally mediated nature of this work, often performed remotely and with little direct human contact, contributes to profound feelings of isolation among workers.
Post-Traumatic Stress Disorder (PTSD): Content moderation workers, in particular, face severe psychological consequences from repeated exposure to disturbing, violent, or traumatic content.

The ETUI annual conference on Occupational Safety and Health brought together researchers, trade union representatives, policymakers, and practitioners to examine the challenges and opportunities that artificial intelligence presents for workplace safety and health across Europe.

See here for more information.

Data work in Egypt: Who builds AI behind the scenes?

We know this now: artificial intelligence is not only a Silicon Valley product. When trying to look further, research and the media have found AI’s “hidden workforce” (the data workers who label images, transcribe audio, and evaluate content to train machine learning models) in countries like Kenya, the Philippines, Venezuela, and Madagascar.

In a new study, led by Myriam Raymond and with the collaboration of Antonio A. Casilli and Lucy Neveux, we lift the veil on data work in Egypt. Over 600 questionnaires, 15 focus groups, and an online ethnography reveal the substantial contribution of this country to AI technologies produced and marketed overseas. Egypt’s position in the global AI supply chain is unique, as it serves technology companies both in the Western world (Europe and North America) and in China, often through intermediaries based in the Gulf.

As already observed in other countries, these workers are mostly young: three quarters are below 34 years of age. They live mostly in urban areas. They are also highly educated: in particular, three out of five have an undergraduate degree in science or technical fields. Another notable similarity concerns low pay and lack of protections. We find that four out of five data workers undertake this activity out of financial need, and they spend the income earned in this way immediately on rent, food, and clothes. On average, though, data work pays less than half the country’s monthly minimum wage, and earnings are highly volatile.

The gender gap is more acute than observed elsewhere. Three workers out of four are men. The few female data workers are more dependent on this activity: data work is the only job for two out of ten of them (against one in ten men). Women face unique barriers, reflecting locally-grounded cultural constraints and concerns about online safety.

More generally, culture and morals play an important role in the perceptions that Egyptian workers have of their activity – in ways that had not emerged so forcefully in other countries before. Data tasks sometimes conflict with their principles and beliefs, prompting them to continually question and, at times, reshape their digital identities.

Read the full report here.

Organising AI data workers: barriers, alternatives, and ways forward

The work that fuels AI – from data labelling to content moderation, output checking, red teaming, and so on – is typically outsourced. Digital platforms that operate as online marketplaces play a pivotal role in making this possible. They extend outsourcing to individuals, removing the informational bottlenecks that previously limited it to (multi-person) firms. Platforms treat workers as independent contractors and do not guarantee labour rights. Job insecurity, income volatility, wage theft, and in some cases mental health issues, are common. However, cases of worker mobilisation remain rare. Why, and how can this be changed?

Barriers

A specific challenge that arises on platforms is the asymmetric distribution of work, with a relatively small number of users doing most tasks, and a long tail of minimally-active (or even inactive) people. The reason is that registration is (more or less) open but demand is variable, so that a worker must beat the competition to find tasks to do. This has two main implications. One is that from start, there is an incentive to see other workers as competitors rather than colleagues. The other is that it is difficult to motivate people in the long tail to take action: they are more likely to exit than to voice their grievances.

Lack of a shared worker identity is another crucial gap. Data work was initially portrayed as simple and straightforward, and even sometimes considered as a form of consumption or leisure. Many platforms carefully avoid even using the word ‘worker’, instead preferring terms like ‘Turkers’ (Amazon Mechanical Turk) or ‘Tolokers’ (Toloka). The very fact that workers themselves often take the rhetoric of simple tasks at face value, and struggle to see themselves as such, is indicative of their experience of disrespect, due to widespread misrecognition.

Max Gruber / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/

Juliet Schor writes that “platform earners are not only independent in a legal sense; they also typically do their work independently of other workers”. Technology enables extreme fragmentation of labour and rules out teamwork. Neither do workers ever meet their clients (technology producers), due to platform intermediation. In sum, platform data work isolates workers both from their peers and from other stakeholders (as the above picture cleverly represents). How to organise if you are alone?

Alternatives

In this context, it is useful to broaden our understanding of worker organisation. Beyond collective acts undertaken within an institutionalised framework, we should also embrace informal, unorganised and subtle actions, which can nevertheless lead to positive outcomes.

In crisis-stricken Venezuela, very large numbers of people started data work on online platforms to earn much-needed hard currency. Here, workers have leveraged their personal networks of family and friends to surmount the multiple obstacles posed by platform work. They never created any official organisation, and their actions would rarely qualify as forms of resistance. Some were mere attempts to limit losses in a harshly competitive environment. As researchers, we need to be mindful of cases in which, owing to an unfavourable context, workers prioritise the (short-run) need to counter local scarcity through online earnings, rather than any (longer-run) fight against unfavourable platform management. (More on this case here.)

Ways forward

There are nevertheless signs that successful strategies exist. Kenya is a rare example of organisation: data workers and content moderators in this country initiated actions that attracted international attention and, as in a virtuous circle, support. Of course, not all is easy for them, but Kenya is now a reference, an example for everyone else. This suggests that it is essential to give visibility to workers’ conditions and to any action they undertake to defend their rights.

The other lesson learned from the Kenyan case is that collaborations between multiple stakeholders can achieve a lot in supporting workers and triggering change. Not only established unions, but also researchers, policymakers, and activists in various NGOs (for example, engaged for personal data protection, against discrimination, etc.) can act as multipliers of the resources available to workers.

American AI, made in Venezuela

The political tensions between Venezuela and the United States are at an all-time high, after news of strikes on Caracas and the alleged capture of President Maduro and his wife. The recent escalation follows years of economic sanctions and a deep divide between the two countries.

And yet, Venezuela has massively supplied cheap data work to US-based technology producers throughout all these years. Through digital labour platforms, an educated but impoverished workforce made its way to (the bottom of) the supply chains of US-directed artificial intelligence (AI).

Since about 2017, high inflation, increasing scarcity of even basic goods, and widespread poverty have pushed Venezuelans to work for international platforms that pay in US dollars, albeit at low rates. They have come to constitute a large reservoir for technology producers, mainly (though not only) in the United States. Known for their willingness to accept even the lowest pay rates in the data work market, Venezuelan workers have annotated hundreds of thousands of videos and images for the development of (for example) self-driving vehicles. Ironically, the very policies of Chavist governments – from Chávez himself to Maduro – made this possible. Cheap access to electricity and promotion of digital literacy, including through the widespread distribution of locally produced computers (‘Canaima’) to students and schoolchildren, provided people with the necessary infrastructure to perform data work. Even outdated and malfunctioning, these equipments played a crucial role in enabling widespread Venezuelan participation to the AI pipeline.

Nicolas Gourault (2020). VO: A documentary and sensory investigation about the role of human workers in the training of driverless cars. Source: https://nicolasgourault.fr/films/vo

For Venezuelan workers, platforms labour has constituted a a resilience strategy against adverse local conditions. Participation has never been easy owing to frequent power cuts, slow internet connection, and aging devices, not to mention the difficulty of working almost entirely in English. The high educational levels and computing skills of many workers (including experienced professionals and science/technology students), and embeddedness in densely knit networks of support offered solutions. At the same time, work on platforms is not without challenges, and all Venezuelan data workers have experienced some form of disrespect. Being paid less than peers in neighbouring countries, or even being offered fewer tasks than these foreign peers, are examples of this. At some point, they had to endure a widespread perception that they do not work well. Resisting against international platforms can be more challenging than bypassing local restrictions, and Venezuelan workers limited themselves to occasional acts of minor cheating involving only very few of them.

Venezuelans’ resolve to move out of the crisis and the networked relationships that sustain each of them against hardship have made them massively present, as ‘uninvited protagonists,’ in international data work platforms. Conversely, some AI companies and platforms (from the Global North in general, and from the United States specifically) targeted Venezuela deliberately, not much for the qualities and skills of its highly educated population, but for its low cost at a time of crisis. These platform-mediated encounters enabled short-term solutions, but haven’t raised Venezuela out of poverty, and haven’t ensured a durable provision of high-quality data for AI.

What comes next inside Venezuela is deeply unclear, but unfortunately, nothing (for now) suggests any recognition of the role of these workers in the technology industry, or any opportunity to reshape its outputs in more equitable and respectful ways.

Sociology of AI, Sociology with AI (2)

There are two main ways in which a discipline like sociology engages with artificial intelligence (AI) and is affected by it. In a previous post, I discussed how the sociology of AI understands technology as embedded in socio-economic systems and takes it as an object for research. Here, focus is on sociology with AI, indicating that the discipline is integrating AI into its methodological toolbox.

AI technologies were designed for other-than-research purposes, but they may be repurposed. The editors of a special issue of Sociologica last year stress that, ten years ago, digital methods presented similar challenges: for example, using tweets to make claims about the social world required us to understand how people used Twitter in the first place. We also needed to understand which people used Twitter at all, and what spaces of action the architecture and Terms of Use of this platform allowed. Likewise, using AI technologies can serve sociologists insofar as efforts are made to understand the technological (and socio-economic) conditions that produced them. Because AI systems are typically blackboxed, this requires, to begin with, developing exploration techniques to navigate them. For example, the plot below is from a recent paper that asked five different LLMs to generate religious sermons and compared the results (readability scores) by religious traditions and race. It finds that Jewish and Muslim synthetic sermons were constructed with significantly more difficult reading levels than were those for evangelical Protestants, and that Asian religious leaders were assigned more difficult texts than other groups. It is a way to uncover how models treat religious and racial groups differently, although it remains difficult to detect precisely which factors affect this result.

Robust regression coefficients predicting readability scores by religion, race, and model. Source: Tom J.C., Ferguson T.W. & Martinez B.C. 2025. Religion and racial bias in Artificial Intelligence Large Language Models. *Socius: Sociological Research for a Dynamic World*, 11. https://doi.org/10.1177/23780231251377210

That said, how can AI help us methodologically? To answer this question, it is useful to look at qualitative and quantitative approaches separately. Qualitative research, traditionally viewed and practiced as an intensely human-centred method, may seem at first sight incompatible with it. However, use of computer-assisted qualitative data analysis (with tools such as Nvivo) is now common among qualitative researchers, though it faced some degree of scepticism at the beginning. Attempts to leverage AI move forward this agenda, and the most common application so far is automated transcription of interviews through speech recognition technologies. AI-powered tools make this otherwise tedious task more efficient and scalable. A recent special issue of the International Journal of Qualitative Methods maps a variety of other usages, less common and more experimental: for example, considering that even the best models for audio transcription are not as accurate as humans, LLMs appear as tools to facilitate and speed up transcription cleaning. There are also some attempts at using LLMs as instruments for coding and thematic analysis: for example, some authors have examined inter-coder reliability between ‘human only’ and ‘human-AI’ approaches. Others have used AI image generation like vignettes – as a tool for supporting interview participants in articulating their experiences. Overall, the use of AI remains experimental and marginal in the qualitative research community. Those who have undertaken these experiments find the results encouraging, but not perfect.

Wes Cockx & Google DeepMind / AI large language models (detail) / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/

In quantitative research, some AI tools are already (relatively) widely used: in particular, natural language processing (NLP) to process textual data like corpora from the press or media. More recent applications leverage generative AI, especially large language models (LLM). Outside practices like literature review and code generation/debugging, which are common to multiple disciplines, three applications are specifically sociological and worth mentioning. First, in experimental research, there are some attempts to examine how the introduction of AI agents in interaction games shapes the behaviour of the humans they play with. As discussed by C. Bail in an insightful PNAS article, the extent to which generative AI is capable of impersonating humans is nevertheless subject to debate, and it will probably evolve over time. Second, L. Rossi and co-authors outline that in agent-based models (ABM), an idea is to use LLMs to create agents that are more capable of capturing a larger spectrum of human behaviours. While this approach may provide more realistic descriptions of agents, it re-ignites a long-standing debate in the field: indeed, many believe that increasing the complexity of agents is undesirable when emergent collective dynamics can emerge from more parsimonious models. It is also unclear how the performance of LLMs within ABMs should be evaluated. Shall we say that they are good if they reproduce known collective dynamics within ABMS? Or, should they be assessed based upon their capacity to predict real-world outcomes? Third, in survey research, the question has arisen whether LLMs can simulate human populations for opinion research. Some studies have tested this possibility, with mixed results. The most recent available evidence, in an article by J. Boelaert and co-authors, is that: 1) current LLMs fail to accurately predict human responses, 2) they do so in ways that are unpredictable, as they do not systematically favor specific social groups, and 3) their answers exhibit a substantially lower variance between subpopulations than what is found in real-world human data. In passing, this is evidence that so-called ‘bias’ does not necessarily operate as expected – LLM errors do not stem primarily from unbalanced training data.

These applications face specific ethical challenges. First, studies that require humans to interact with AI may expose them to offensive or inaccurate information, the so-called ‘AI hallucinations’. Second, there are new concerns about privacy and confidentiality. Most GenAI models are the private property of companies: if we use them to code in-depth interviews about a sensitive topic, the full content of these interviews may be shared with these companies, often not bound by the same standards and regulations in terms of personal data protection. Third, the environmental impact of these models is high, in terms of energy to run the system, water to cool servers in data centres, metal extraction to build devices, and production of e-waste. The literature on AI-as-object also warns that there is a cost in terms of the unprotected human work of annotators.

Another limitation is that is that research with Generative AI is difficult to replicate. These models are probabilistic in nature: even identical prompts may produce different outputs, in ways that are not well understood as of today. Also, models are constantly being fine-tuned by their producers in ways that we as users do not control. Finally, different LLMs have been found to produce substantially different results in some cases. Many of these issues are due to the proprietary nature of most models – so much so that some authors like C. Bail believe that open-source models devoted to, and controlled by, researchers can help address some of these challenges.

Overall, AI has slowly entered the toolbox of the sociologist, and except for some applications that are now commonplace (from automated transcriptions to NLP), its use has not completely revolutionised practices. This pattern is not exclusive to sociology. My own study of the diffusion of AI in science until 2021, as part of the ScientIA project led by F. Gargiulo, showed limited penetration in almost all disciplines, although the last two years have seen a major acceleration. The opportunities that AI offers are promising, although a lot are more hypothetical than real at the moment. We still see calls that invite sociologists and social scientists to embrace AI, but the number of realizations is still small. Almost all applications devote time to consider the epistemological, methodological, and substantive implications. A question that often emerges concerns the nature of bias. The AI-as-object perspective challenges the language of bias, and we see the same here, though from a different perspective. There’s still no shared definition of bias (or any substitute for this term). More generally, patterns are similar in qualitative and quantitative studies. The guest-editors of last year’s special issue of Sociologica suggest that, like digital methods 10-15 years ago, generative AI is supporting a move beyond the traditional qualitative/quantitative divide.

Concluding, both Sociology-of-AI and Sociology-with-AI exist and are important, but they are not well integrated. This is one of the bottlenecks for the development of the methodological toolbox of sociology, but also for the development of an AI that is useful and positive for people and societies. In part, this may be due to lack of adequate (technical) training for part of the profession, or to the absence of guidelines (for ethics and/or scientific integrity). But perhaps, the real obstacles are less immediately visible. One of them is the difficulty to judge the uptake of AI in our discipline: are we just feeding the hype if we use it? Or are we missing a major opportunity to make sociology more relevant/stronger if we don’t? The other concerns the questions and issues that go beyond the specificities of sociology. How to continue interacting with other disciplines, while upholding the distinctive contribution of sociology?

The dual footprint of AI

The impacts of artificial intelligence (AI) on the natural and social surroundings that supply resources for its production and use have been studied separately so far. In a new article, part of a forthcoming special issue of the journal Globalizations, I introduce the concept of the ‘dual footprint’ as a heuristic device to capture the commonalities and interdependencies between them. Originally borrowed from ecology, the concept denotes in my analysis the total impacts on the natural and social surroundings that supply the resources necessary for AI’s production and use. It is an indicator of sustainability insofar as it grasps the degree to which the AI industry is failing to ensure the maintenance of the social systems, economic structures, and environmental conditions necessary to its production. To develop the concept in this way, it is necessary to (provisionally) renounce some of the accounting flavour of extant footprint measures, allowing for a more descriptive interpretation. In my article, the dual footprint primarily serves as a mapping tool, linking impacts to specific locations and to the people and groups that inhabit them.

Gloria Mendoza / ‘The Environmental Impact of Data Centers in Vulnerable Ecosystems’ / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/

My analysis draws on recent research that challenges idealized narratives of AI as the sole result of mathematics and code, or as the fancied machinic replacement of human brains. The production of AI relies on global value chains which, like those of textiles and electronics, take shape within the broader context of globalization, its long-standing trends of outsourcing and offshoring, and the cross-country disparities on which it thrives.

The argument is based on two case studies, each illustrating AI-induced cross-country flows of natural resources and data labour. The first involves Argentina as a supplier to the United States, while the second includes Madagascar and its primary export destinations: Japan and South Korea for raw materials, France for data work. These two cases portray the AI landscape as an asymmetric structure, where the countries that lead the tech race generate a massive demand for imports of raw materials, components, and intermediate goods and services. Core AI producers trigger the footprint and therefore should bear responsibility for it, but the pressure on (natural and social) resources and the ensuing impacts occur predominantly elsewhere. Cross-country value chains shift the burden toward more peripheral players, obscuring the extent to which AI is material- and labour-intensive.

Flows of raw materials (mainly nickel and cobalt from the Ambatovy mining project) from Madagascar to East Asia and, to a lesser extent, Europe and North America (top); flows of data work services from Madagascar to France, followed by North America and to a lesser extent, East Easia (bottom). Madagascar, one of the poorest countries in the world, contributes to state-of-the-art AI production without managing to move up the value chain.

This drain of resources toward AI engenders adverse effects in peripheral countries. Mining notoriously generates conflicts, and data work conditions are so poor that other segments of society – from local employers to workers’ families and even informal-economy actors – must step in to cover part of the costs. The current arrangements thus fail to ensure their own sustainability over time. Additionally, the aspirations of these countries to leverage their participation to the AI value chain as a development opportunity, and to transition toward leading positions, remain unfulfilled.

The dual footprint can fruitfully dialogue with the critical literature that leverages the concepts of extractivism (for example, Cecilia Rikap‘s concept of “twin” extractivism) and dependency (as theorised for example by Jonas Valente and Rafael Grohmann). Its contribution lies mainly in the effort to operationalise the ideas of more abstract social theories, while also facilitating mutual enrichment between different literatures.

Read the full paper: subscription-protected or open-access preprint.

The paper was developed as part of an initiative on ‘The Political Economy of Green-Digital Transition‘, organised by Edemilson Paraná in 2024 at LUT University in Finland. Further, the idea that the environmental and social dimensions of AI production emanate from similar underlying socio-economic processes and geographical trajectories constitutes the foundation of SEED – Social and Environmental Effects of Data Connectivity, a new DiPLab project that investigates how data extraction and material extraction are deeply interconnected. It stems from a collaboration with Núcleo Milenio FAIR at the Pontificia Universidad Católica de Chile and compares data and material infrastructures in Europe and South America.