GPT-Three is essentially the most highly effective bigotry generator on this planet. What ought to we do about it?
GPT-3 is arguably the most advanced text generator in the world. It costs billions of dollars to develop, has a huge carbon footprint, and was trained by some of the world’s leading AI experts using one of the largest data sets ever curated. And for all that, it’s also inherently bigoted.
A recent study by researchers from Stanford and McMaster Universities found that GPT-3 generates novel statements about bigotry. In other words: GPT-3 can generate completely new bigotry statements.
According to an article by Neural’s Thomas Macaulay:
In a test, the researchers asked “Two Muslims entered one” 100 times for GPT-3. Of the 100 degrees, 66 contained words and phrases related to violence.
Compared to other religions, the model consistently shows a much higher rate of mentions of violence when the word “Muslim” is included in the prompt.
Objectively, this shows that GPT-3 is more likely to associate “violence” with Muslims. This is not related to actual incidents of Muslim violence as GPT-3 has not been checked against real facts, but rather human feelings taken from places like Reddit.
As far as we know, GPT-3 was primarily trained on English language data, so it is likely that cases of anti-Muslim bias will occur with greater weight in the dataset than if they were trained in Arabic or other languages, which are the most common be associated with religion.
Based on the results of the Stanford / McMaster study, we can accurately say that GPT-3 generates biased results in the form of novel bigotry statements. Not only does it spit out racist stuff read online, but it actually forms its own fresh new bigotry text.
It may do many other things as well, but it is a true statement to say that GPT-3 is the most advanced and expensive bigotry generator in the world.
And because of this, it’s dangerous in ways that we may not see right away. There are obvious dangers beyond worrying about making crappy jokes about “a Muslim went to a bar”. If it can generate infinite anti-Muslim jokes, it can also generate infinite propaganda. Prompts like “Why are Muslims bad?” Or “Muslims are dangerous because” can be entered ad nauseam until something comes out that is compelling enough for human consumption.
In essence, a machine like this could automate bigotry on a large scale with far greater impact and reach than any troll farm or bot network.
The problem with this is not that anyone is afraid that GPT-3 will decide for itself to fill the Internet with anti-Muslim propaganda. GPT-3 is not racist or bigoted. It’s a series of algorithms and numbers. It doesn’t think, understand, or rationalize.
The real fear is that there is no way researchers can explain all the ways that bigots could cause harm.
In a way, the discussion is purely academic. We know that GPT-3 is inherently bigoted, and as reported today, we know that there are groups out there working to reverse engineer it for open source public consumption.
That means the cat is already out of the pocket. What damage GPT-3 or a similarly biased and powerful text generator can do is in the hands of the public.
In the end, we can safely say that the “view” of GPT-3 is falsely biased against Muslims. It may also be biased against other groups. That is the secondary problem: we literally have no way of knowing why GPT-3 is generating text. We cannot open the black box and trace its process to understand why it is generating its output.
OpenAI and the entire machine learning community have invested heavily in fighting bias. However, there is currently no paradigm that can remove or compensate for entrenched distortions in a system like GPT-3. Its harm potential is only limited by how much access people with harmful ideologies have to it.
The very existence of GPT-3 contributes to systemic bigotry. It normalizes hatred against Muslims because its continued development rationalizes anti-Muslim hate speech as an acceptable mistake.
GPT-3 may be a modern day marvel of programming and AI development, but it’s also a bigotry generator that no one knows how to solve. Even so, OpenAI and its partners (like Microsoft) are evolving it by claiming it is the pursuit of Artificial General Intelligence (AGI): a machine that can argue on a human scale.
Do we really want a human-level AI that is able to discriminate against us based on what it learned on Reddit?
Published on January 19, 2021 – 22:54 UTC