April | 2023 | Status-Q

Categories:

Tags:

Sign of the times: might ChatGPT re-invigorate GPG?

It’s important to keep finding errors in LLM systems like ChatGPT, to remind us that, however eloquent they may be, they actually have very little knowledge of the real world.

A few days ago, I asked ChatGPT to describe the range of blog posts available on Status-Q. As part of the response it told me that ‘the website “statusq.org” was founded in 2017 by journalist and author Ben Hammersley.’ Now, Ben is a splendid fellow, but he’s not me. And this blog has been going a lot longer than that!

I corrected the date and the author, and it apologised. (It seems to be doing that a lot recently.) I asked if it learned when people corrected it, and it said yes. I then asked it my original question again, and it got the author right this time.

Later that afternoon, it told me that StatusQ.org was the the personal website of Neil Lawrence.

Unknown

Neil is also a friend, so I forwarded it to him, complaining of identity theft!

A couple of days later, my friend Nicholas asked a similar question and was informed that “based on publicly available information, I can tell you that Status-Q is the personal blog of Simon Wardley”. Where is this publicly-available information, I’d like to know!

The moral of the story is not to believe anything you read on the Net, especially if you suspect some kind of AI system may be involved. Don’t necessarily assume that they’re a tool to make us smarter!

When the web breaks, how will we fix it?

So I was thinking about the whole question of attribution, and ownership of content, when I came across this post, which was written by Fred Wilson way back in the distant AI past (ie. in December). An excerpt:

I attended a dinner this past week with USV portfolio founders and one who works in education told us that ChatGPT has effectively ended the essay as a way for teachers to assess student progress. It will be easier for a student to prompt ChatGPT to write the essay than to write it themselves.

It is not just language models that are making huge advances. AIs can produce incredible audio and video as well. I am certain that an AI can produce a podcast or video of me saying something I did not say and would not say. I haven’t seen it yet, but it is inevitable.

So what do we do about this world we are living in where content can be created by machines and ascribed to us?

His solution: we need to sign things cryptographically.

Now this is something that geeks have been able to do for a long time. You can take a chunk of text (or any data) and produce a signature using a secret key to which only you have access. If I take the start of this post: the plain text version of everything starting from “It’s important” at the top down to “sign things cryptographically.” in the above paragraph, I can sign it using my GPG private key. This produces a signature which looks like this:

If you were so inclined, you could easily find my corresponding public key online and use it to verify that signature. What would that tell you?

Well, it would say that I have definitely asserted something about the above text: in this case, I’m asserting that I wrote it. It wouldn’t tell you whether that was true, but it would tell you two things:

It was definitely me making the assertion, because nobody else could produce that signature. This is partly because nobody else has access to my private key file, and even if they did, using it also requires a password that only I know. So they couldn’t produce that signature without me. It’s way, way harder than faking my handwritten signature.
I definitely had access to that bit of text when I did so, because the signature is generated from it. This is another big improvement on a handwritten signature: if I sign page 6 of a contract and you then go and attach that signature page to a completely new set of pages 1-5, who is to know? Here, the signature is tied to the thing it’s signing.

Now, I could take any bit of text that ChatGPT (or William Shakespeare) had written and sign it too, so this doesn’t actually prove that I wrote it.

But the key thing is that you can’t do it the other way around: somebody using an AI system could produce a blog post, or a video or audio file which claims to be created by me, but they could never assert that convincingly using a digital signature without my cooperation. And I wouldn’t sign it. (Unless it was really good, of course.)

Gordon Brander goes into this idea in more detail in a post entitled “LLMs break the internet. Signing everything fixes it.” The gist is that if I always signed all of my blog posts, then you could at least treat with suspicion anything that claimed to be by me but wasn’t signed. And that soon, we’ll need to do this in order to separate human-generated content from machine-generated.

A tipping point?

This digital signature technology has been around for decades, and is the behind-the-scenes core of many technologies we all use. But it’s never been widely, consciously adopted by ordinary computer users. Enthusiasts have been using it to sign their email messages since the last millennium… but I know few people who do that, outside the confines of security research groups and similar organisations. For most of us, the tools introduce just a little bit too much friction for the perceived benefits.

But digital identities are quickly becoming more widespread: Estonia has long been way ahead of the curve on this, and other countries are following along. State-wide public key directories may eventually take us to the point where it becomes a matter of course for us automatically to sign everything we create or approve.

At which point, perhaps I’ll be able to confound those of my friends and colleagues who, according to ChatGPT, keep wanting to pinch the credit for my blog.

qsf

13 Apr 2023

0 Comments

Categories:

Electric Vehicles

Tags:

social networks

Tweetback

For all its problems over the years, one thing that Twitter has always done quite well is to give you access to backups of your data. You can request a dump of your archive and (a day or two later) download a ZIP file containing all of your Tweets and lots of other data besides, which you could process with your own code, but can also view using an HTML page included in the archive.

As the future of Twitter gets ever more uncertain, I ask for backups more frequently! You might want to do the same… one day you may not be able to.

I had an account in the very early days, and then deleted it, before joining again a little while later. So the first tweet on my current account, just over 15 years ago, says:

I didn’t have an iPhone back then – it existed, but I waited for the 3G version, which arrived a few months later. So quite a few of my early Tweets are about the fun things I could do with my iPod Touch, though I guess most of them would actually have been submitted using SMS text messaging, as you did back then.

I was also amused to see that one of my earliest messages was while barbecuing in the garden. I was recommending a local brand of sausage, which I can still recommend as one of my favourites: “Musk’s“.

That name, but attached to another owner, was to pop up on Twitter occasionally in the future.

I had actually met Elon Musk a couple of years before, but little did I know that his name would one day be so closely associated with this new social network. Back then, he was mostly known for PayPal and SpaceX, though he did have an intriguing side-interest in electric cars. It didn’t look likely that I (or most of my friends) would ever be able to afford one, though, so his other activities were of more interest.

These days, I still consume that fine brand of Newmarket sausages with the same frequency and enthusiasm as I did back then. Other things have changed, however. I drive an electric car every day, and have barely been in a petrol station for 7+ years. Twitter, on the other hand, I only look at a couple of times per week. Elon Musk is primarily responsible for both of those changes!

qsf

11 Apr 2023

0 Comments

Categories:

General

Perhaps Methuselah didn’t live 900 years after all?

On Sunday I had a video call with an old family friend, Marjorie, who has just celebrated her 103rd birthday, and is doing well. To put that in context, she was a teenager when the king acceded to the throne. No, not the current king. His grandfather. You know, the one whose daughter reigned for three score years and ten after him.

So I had longevity in mind when I saw Charles Arthur’s link to a rather nice study by S. J. Newman from the Australian National University, looking at the data about other people who have lived beyond 100 years. The records of supercentenarians tend to cluster in particular geographic areas, and many reasons have been suggested for this.

As the paper says,

The observation of individuals attaining remarkable ages, and their concentration into geographic sub-regions or ‘blue zones’, has generated considerable scientific interest. Proposed drivers of remarkable longevity include high vegetable intake, strong social connections, and genetic markers. Here, we reveal new predictors of remarkable longevity and ‘supercentenarian’ status.

It’s a nice read, and a good lesson in why those involved in data analysis sometimes need to dig a bit deeper.

The quick summary is that areas reporting large numbers of supercentenarians not only have high degrees of poverty, social interaction, and high vegetable intake. They are also areas where reliable record-keeping was only recently introduced, or where there may be other reasons for assuming that reports of very old people are not entirely accurate.

Some of this can be deduced by looking at the data directly, for example the fact that “supercentenarian birthdates are concentrated on the first of the month and days divisible by five” or that some areas with numerous residents in their 100s seem to have surprisingly few residents in their 90s.

Sometimes there are other reasons for doubting the reliablility of the data:

The large-scale US bombing and invasion of Okinawa involved the destruction of entire cities and towns, obliterating around 90% of the Koseki birth and death records with almost universal losses outside of Miyako and the Yaeyama archipelago. Post-war Okinawans subsequently requested replacement documents, using dates recalled from memory in different calendars, from a US-led military government that largely spoke no Japanese.

Sometimes, past researchers have had to use their own judgement when assessing individual cases:

Individual case studies often highlight the role of personal judgement, and the potential for both conscious and unconscious bias, during age validation. For example, Jiroemon Kimura, the world’s oldest man, is widely considered to be a valid supercentenarian case. However, Kimura has at least three wedding dates to the same wife, has three dates of graduation from the same school, was conscripted to the same military three times in four years despite the mandatory conscription period being three years long, and has at least two birthdays.

Then there are the financial issues to consider:

In 1997 Italy discovered it was paying 30,000 pensions to dead people

and

A subsequent 2012 investigation by the Greek labor ministry was triggered by the “unusually high number of 9,000 Greek centenarians drawing old-age benefits”, a notable figure given the 2011 Greek census found only 2,488 living centenarians.

Yes, the sad conclusion is that:

Like the ‘blue zone’ islands of Sardinia and Ikaria, Okinawa represents deprived regions of rich, high-welfare states. These regions may have higher social connections, and arguably may have had higher vegetable intakes in the past, but they also rank amongst the least educated, poorest, highest-crime and least healthy regions of their respective countries.

Lastly, you know those reports that the longest-lived are often people whose lives are characterised by smoking, drinking and unhealthy lifestyles? The paper has some wry comments about that too:

It is unclear why clinically excessive drinkers or daily smokers should survive at equal or higher rates, and increase in population frequency at extreme ages, unless these lifestyle factors are positively correlated with committing fraud or having an incorrect age.

And, referring again to the decisions of the researchers who build the databases:

A reliance on this type of opinion, where qualitative judgements are employed to shape public perceptions of authenticity, seems to be widely considered satisfactory. This seems particularly the case when explaining the otherwise anomalous health habits of supercentenarians. For example, Maier et al. issued a contradictory statement that Jeanne Calment smoked both one and two cigarettes a day for an entire century, followed by the justification that this counter-indication of health could be explained because she “possibly did not inhale at all”.

It was likewise observed that, from age 20 to age 117, the then-oldest man in the world, Christian Mortensen, smoked “mainly a pipe and later on cigars, but almost never cigarettes… he had also chewed tobacco…but never inhaled”.

Why two people would voluntarily choose to smoke for an accumulated 190 years, yet never inhale, was never explained.

Status-Q

Quentin Stafford-Fraser's blog

Monthly Archives: April, 2023

Sign of the times: might ChatGPT re-invigorate GPG?

Perhaps Methuselah didn’t live 900 years after all?

Zappi Days

Feeds

Recent Posts

Recent Comments

Categories

About Quentin

Links

Meta

About Quentin

Links