FreshRSS

๐Ÿ”’
โŒ About FreshRSS
There are new available articles, click to refresh the page.
Before yesterdayYour RSS feeds

Jenny Warner obituary

My friend Jenny Warner, who has died aged 87, was a speech therapist and one of the three founding members, in the mid-1970s, of the faculty in speech pathology and therapy at the University of Manchester. There she combined clinical practice with lecturing and writing academic papers and practical works.

Born in Kuala Lumpur in Malaya (now Malaysia), Jenny escaped the Japanese occupation of the country with her mother, Winifred (nee Herbertson), a secretary, at the age of six. After making their way to Singapore, they secured passage to Britain on the last evacuation ship to leave, in January 1942. Her father, Stanley Warner, who served in the RAF, rejoined the family in August of that year but was killed a few months later during a German bombing raid while he was a patient at the RAF officersโ€™ hospital based at the Palace hotel in Torquay, Devon, in 1942.

Continue reading...

ChatGPT-4 Knows Malaysiaโ€™s Ethnic Order

I am writing a book about Malaysiaโ€™s ethnic order*, using the Malay world as a way to think more generally about social categories like ethnicity and how to conceptualize them. This book project combines all of my interests: maritime Southeast Asia, political economy, colonial history (Portuguese, Dutch, English, and Japanese), language, analytic philosophy, social ontology, diaspora studies, criticizing things for not really making sense, trying to subvert paradigms, making custom maps, and statistics. The last of these is the subject of this post.

One of the tasks of my book is to demonstrate that my understanding of Malaysiaโ€™s ethnic order is not simply a projection of my own biases onto Malaysian history, society, and culture. To do this, Iโ€™ve been using survey data collected from Malaysians (and Indonesiansโ€ฆ although thatโ€™s not really relevant for todayโ€™s post) to try to characterize the countryโ€™s ethnic order. An ethnic order, for me, is

the set of beliefs, practices, and ancillary institutions that organize and sustain ethnic categories within a given political community

Such an abstract concept is hard to characterize, and hard to survey people about without supplying exactly the answers that I hope that they will provide (thus invalidating the exercise). So what I have done is to survey Malaysians and ask them questions about what comes to mind when they think of various ethnic groups in the country. They provided open-ended responses, which I analyzed through a Structural Topic Model to estimate the relative salience of different types of responses for different types of ethnic groups.

The result is an analysis that says that, for example, when responses are generally focusing on things like language and religion, they are more likely to be talking about Malays versus Chinese. When theyโ€™re talking about economic matters, they are more likely to be talking about Chinese than Indians. And so forth. When the book is done, it will explain why such results are consistent with my particular approach to understanding Malayness in Malaysia.**

But with ChatGPT-4 and other large language models (LLMs) completely blowing up everything, it occurred to me this morning that one might wonder what the computer says about my data. I was inspired to do this by reading this paper from Petter Tรถrnberg, which used ChatGPT-4 to predict what the partisan affiliated with the authors of tweets. Iโ€™m reporting here what I found from some initial playing around, and it is remarkable, scary, and open to a lot of interpretation (more on this at the end).

Here is what I did: I logged into ChatGPT-4 and opened up my file containing the open-ended survey responses. I then typed the following at the prompt:

I asked a Malaysian survey respondent the following question:

Now I am going to ask you about several different ethnic groups. I would like to you to tell me two (2) things that come to mind when you think about these groups. There is no right or wrong answer; you may think of particular words or phrases, or perhaps nothing at all.

I asked this about ethnic Malays, ethnic Chinese, and ethnic Indians in Malaysia. Imagine that the respondent gave the response โ€œkurang rajin Islam.โ€ Which of the three ethnic groups above do you think this response would be associated with?

The words kurang rajin Islam are the responses that one survey respondent actually gave. It is a combination of two responses, because I asked the respondent this question twice (one response was kurang rajin and the other was Islam). After hitting Enter, the ChatGPT-4 spit out the following:

Based on the response โ€œkurang rajin Islam,โ€ it is likely associated with the ethnic Malay group in Malaysia.

โ€œKurang rajinโ€ translates to โ€œless diligentโ€ in English, and โ€œIslamโ€ refers to the religion of Islam. The phrase suggests a perception or association related to the Malay community and their commitment or dedication to their religious practices. Itโ€™s important to note that this response reflects the perspective of the individual respondent and should not be taken as a generalization for the entire Malay ethnic group.

This is actually bananas. The computer correctly identified which group that response was associated with. And it kept doing it. I asked

Imagine that the respondent gave the response โ€œpemegang ekonomi.โ€ Which of the three ethnic groups above do you think this response would be associated with?

And the computer said

The response โ€œpemegang ekonomiโ€ translates to โ€œeconomic holdersโ€ or โ€œeconomic ownersโ€ in English. Given this response, it is likely associated with the ethnic Chinese group in Malaysia.

The phrase โ€œpemegang ekonomiโ€ suggests a perception or association related to the economic prowess or prominence of the ethnic Chinese community in Malaysia. It reflects the recognition of their significant contributions to the economic sector in the country. However, it is important to note that this response represents the viewpoint of the individual respondent and should not be taken as a generalization for the entire ethnic Chinese group.

It gets worse, but Iโ€™ll spare you. And although it doesnโ€™t always work, these initial forays show that LLMs are remarkably accurate at predicting what group a short survey response was referring to in this specific example. You can think of a lot of other ways that one might study social categories using these AI tools.

I donโ€™t know exactly what to do with these results, but I do know that theyโ€™re getting at something significant. For one, the AI developers are mostly trying to rein in LLMs to prevent them from saying racist, sexist, and otherwise biased things. But here is a case where we have the reverse: actual statements (and rather explicitly biased in both cases) by real humans that can be classified according to the best guess of who is being referred to. How can this be stopped?

But I think that thereโ€™s another level at which to think about this result. That is, knowing that ChatGPT-4 is not thinking, but rather just doing a super-duper prediction model of what words are most likely to be sensible responses to other words, what could one learn from this? Is this evidence of something, and if so, of what? What people think? What people have written down? What Malaysians have written down? What researchers or journalists have written down?

There are even more levels. What about the coarseness of my own prompt (I wrote Malaysia, when I really mean peninsular Malaysia)? Should I give feedback on these results, thus contributing to reinforcement learning? If I write up these results, would I then be further contributing to stereotypes by producing a text that will feed into the same LLMs to produce even more such results? What if my own ethical goals are not to reinforce, but rather to undermine the present social order?

I donโ€™t have any great answer to these questions. I am a pessimist about what LLMs will do to human society who nevertheless marvels at what they can do sometimes.

NOTES

* You can think of an ethnic order as like a โ€œracial order.โ€

** But not Indonesia. Thatโ€™s important. And also, not really even Sabah/Sarawak, this is about peninsular Malaysia. Thatโ€™s also important.

tompepinsky

โŒ