Authorship and Source Integrity Challenges for Future Historians in the Age of AI

Richard Toye

Author

•

min read

August 12, 2024

Share this article on:

The Rise of AI-Generated Content

The advent of AI-generated content presents a number of challenges for historians. The current focus is often on student work and the concern that tools like ChatGPT may be used to generate essays, leading to questions about whether the credit given is deserved. However, there is another side to this issue: how will future historians treat AI-generated sources? Will it be possible to detect if a piece of work was created by AI or influenced by machine learning methods used in historical research?

There are certainly a number of tools that claim to detect AI involvement in the production of texts. Yet, it is difficult to be certain of their accuracy unless one understands the underlying algorithm—a level of forensic work that very few historians are likely to undertake. Even if the algorithm is accessible, many are housed inside so-called "black boxes," where their creators keep them secret for commercial reasons.

AI in Historical Research: From OCR to Handwriting Recognition

Will historians ever have retrospective access to these algorithms? It is possible in some cases, especially if programmers choose to make them freely available as open-source software. Nonetheless, the history of algorithms, let alone the history of the documents they have been used to produce, will be a very challenging field to study. In fact, we should recognize how far we already rely on various forms of AI in historical research. For example, many digitized documents depend on optical character recognition (OCR) or even AI-driven handwriting recognition, which increasingly interact with natural language processing, text mining, and semantic analysis of historical texts.

These technologies are now intrinsic parts of our research process, even though we know that OCR can often be unreliable. While we might acknowledge this in our research findings, there is no definitive way to overcome the issue, although the technology continues to improve.

The Future of Authorship: Navigating AI-Influenced Texts

Another issue arises with text that one might consider "ordinary." Even these may have elements of AI prediction built into them. For instance, when typing a sentence, many word processing programs suggest what might come next. If you accept that suggestion, are you still the original author of the sentence? The same could be said of predictive text on mobile phones. Of course, the word suggested might be the one you were going to use anyway, but who can really say? Moreover, there is currently no way to retrospectively analyze documents—unless specifically preserved for this purpose—to determine what was human-generated and what was computer-generated or shaped by artificial intelligence in history workflows.

Preserving the Integrity of Historical Sources in an AI-Driven World

None of this is intended to discourage the use of AI, which is becoming increasingly ubiquitous and indeed unavoidable. However, authorship will be much more complicated to discern in the future, particularly when the traditional giveaways, such as handwriting or specific typewriter models, are no longer present to provide clues. Perhaps what historians need to do is to explore whether there are ways to identify authorship or at least find clues to authorship, bearing in mind that concerns over these questions are not new, especially when dealing with ancient documents of anonymous origin or materials processed through digital humanities tools and historical data analysis techniques.

‍

Richard Toye

Professor of Modern History, University of Exeter

Meet on:

Don't miss out on the latest news!

Oops! Something went wrong while submitting the form.

Contribute to Historica's blog!

Learn guidelines, requirements, and join our history-loving community.

Become an author

FAQs

How can I contribute to or collaborate with the Historica project?

If you're interested in contributing to or collaborating with Historica, you can use the contact form on the Historica website to express your interest and detail how you would like to be involved. The Historica team will then be able to guide you through the process.

What role does Historica play in the promotion of culture?

Historica acts as a platform for promoting cultural objects and events by local communities. It presents these in great detail, from previously inaccessible perspectives, and in fresh contexts.

How does Historica support educational endeavors?

Historica serves as a powerful tool for research and education. It can be used in school curricula, scientific projects, educational software development, and the organization of educational events.

What benefits does Historica offer to local cultural entities and events?

Historica provides a global platform for local communities and cultural events to display their cultural artifacts and historical events. It offers detailed presentations from unique perspectives and in fresh contexts.

Can you give a brief overview of Historica?

Historica is an initiative that uses artificial intelligence to build a digital map of human history. It combines different data types to portray the progression of civilization from its inception to the present day.

What is the meaning of Historica's principles?

The principles of Historica represent its methodological, organizational, and technological foundations: Methodological principle of interdisciplinarity: This principle involves integrating knowledge from various fields to provide a comprehensive and scientifically grounded view of history. Organizational principle of decentralization: This principle encourages open collaboration from a global community, allowing everyone to contribute to the digital depiction of human history. Technological principle of reliance on AI: This principle focuses on extensively using AI to handle large data sets, reconcile different scientific domains, and continuously enrich the historical model.

Who are the intended users of Historica?

Historica is beneficial to a diverse range of users. In academia, it's valuable for educators, students, and policymakers. Culturally, it aids workers in museums, heritage conservation, tourism, and cultural event organization. For recreational purposes, it serves gamers, history enthusiasts, authors, and participants in historical reenactments.

How does Historica use artificial intelligence?

Historica uses AI to process and manage vast amounts of data from various scientific fields. This technology allows for the constant addition of new facts to the historical model and aids in resolving disagreements and contradictions in interpretation across different scientific fields.

Can anyone participate in the Historica project?

Yes, Historica encourages wide-ranging collaboration. Scholars, researchers, AI specialists, bloggers and all history enthusiasts are all welcome to contribute to the project.

Authorship and Source Integrity Challenges for Future Historians in the Age of AI

The Rise of AI-Generated Content

AI in Historical Research: From OCR to Handwriting Recognition

The Future of Authorship: Navigating AI-Influenced Texts

Preserving the Integrity of Historical Sources in an AI-Driven World

People also read

AI and the Future of the Museum Experience

Teaching Machines Empathy: The Human Lessons in War Literature

When AI ‘Fictions’ Redirect History: Generative Models, Historiography and Misinformation

Contribute to Historica's blog!

FAQs