Aston University forensic linguistics experts partner in $11.3 million funding for authorship attribution research

Dec 7, 2022

4 min


  • Aston Institute for Forensic Linguistics (AIFL) is part of the project to infer authorship of uncredited documents based on writing style
  • AIFL’s Professor Tim Grant and Dr Krzysztof Kredens are experts in authorship analysis
  • Applications may include identifying counterintelligence risks, combating misinformation online, fighting human trafficking and even deciphering authorship of ancient religious texts.


Aston University’s Institute for Forensic Linguistics (AIFL) is part of the AUTHOR research consortium which has won an $11.3 million contract to infer authorship of uncredited documents based on the writing style.


The acronym stands for ‘Attribution, and Undermining the Attribution, of Text while providing Human-Oriented Rationales’. Worth $1.3 million, the Aston University part of the project is being led by Professor Tim Grant and Dr Krzysztof Kredens, who both are recognised internationally as experts in authorship analysis and who both engage in forensic linguistic casework as expert witnesses.


In addition to their recognised general expertise and experience in this area, Professor Grant has specific expertise in using linguistic analysis to enhance online undercover policing and Dr Kredens has led projects to develop authorship identification techniques involving very large numbers of potential authors.


The AUTHOR team is led by Charles River Analytics and is one of six teams of researchers that won The Human Interpretable Attribution of Text Using Underlying Structure (HIATUS) programme sponsored by the Intelligence Advanced Research Projects Activity (IARPA). The programme uses natural language processing techniques and machine learning to create stylistic fingerprints that capture the writing style of specific authors.


On the flip side is authorship privacy - mechanisms that can anonymize identities of authors, especially when their lives are in danger. Pitting the attribution and privacy teams against each other will hopefully motivate each, says Dr Terry Patten, principal scientist at Charles River Analytics and principal investigator of the AUTHOR consortium.


“One of the big challenges for the programme and for authorship attribution in general is that the document you’re looking at may not be in the same genre or on the same topic as the sample documents you have for a particular author,” Patten says. The same applies to languages: We might have example articles for an author in English but need to match the style even if the document at hand is in French. Authorship privacy too has its challenges: users must obfuscate the style without changing the meaning, which can be difficult to execute.”


In the area of authorship attribution, the research and casework experience from Aston University will assist the team in identifying and using a broad spectrum of authorship markers. Authorship attribution research has more typically looked for words and their frequencies as identifying characteristics. However, Professor Grant’s previous work on online undercover policing has shown that higher-level discourse features - how authors structure their interactions - can be important ‘tells’ in authorship analysis.


The growth of natural language processing (NLP) and one of its underlying techniques, machine learning, is motivating researchers to harness these new technologies in solving the classic problem of authorship attribution. The challenge, Patten says, is that while machine learning is very effective at authorship attribution, “deep learning systems that use neural networks can’t explain why they arrived at the answers they did.”


Evidence in criminal trials can’t afford to hinge on such black-box systems. It’s why the core condition of AUTHOR is that it be “human-interpretable.” Dr Kredens has developed research and insights where explanations can be drawn out of black box authorship attribution systems, so that the findings of such systems can be integrated into linguistic theory as to who we are as linguistic individuals.


Initially, the project is expected to focus on feature discovery: beyond words, what features can we discover to increase the accuracy of authorship attribution?


The project has a range of promising applications – identifying counterintelligence risks, combating misinformation online, fighting human trafficking, and even figuring out the authorship of ancient religious texts.


Professor Grant said: “We were really excited to be part of this project both as an opportunity to develop new findings and techniques in one of our core research areas, and also because it provides further recognition of AIFL’s international reputation in the field. Dr Kredens added: “This is a great opportunity to take our cutting-edge research in this area to a new level”.


Professor Simon Green, Pro-Vice-Chancellor for Research, commented: “I am delighted that the international consortium bid involving AIFL has been successful. As one of Aston University’s four research institutes, AIFL is a genuine world-leader in its field, and this award demonstrates its reputation globally. This project is a prime example of our capacities and expertise in the area of technology, and we are proud to be a partner.”


Patten is excited about the promise of AUTHOR as it is poised to make fundamental contributions to the field of NLP. “It’s really forcing us to address an issue that’s been central to natural language processing,” Patten says. “In NLP and artificial intelligence in general, we need to find a way to build hybrid systems that can incorporate both deep learning and human-interpretable representations. The field needs to find ways to make neural networks and linguistic representations work together.”


“We need to get the best of both worlds,” Patten says.


The team includes some of the world’s foremost researchers in authorship analysis, computational linguistics, and machine learning from Illinois Institute of Technology, Aston Institute for Forensic Linguistics, Rensselaer Polytechnic Institute, and Howard Brain Sciences Foundation.

You might also like...

Check out some other posts from Aston University

2 min

Aston University researcher takes on leadership role within biomedical engineering

Dr Antonio Fratini is the new chair of the Institute of Mechanical Engineers Biomedical Engineering Division It is one of the largest group of professional biomedical engineers in the UK The specialism merges professional engineering with medical knowledge of the human body, such as artificial limbs and robotic surgery. An Aston University researcher has been given a leading role within the biomedical engineering sector. Dr Antonio Fratini CEng MIMechE has been elected as the new chair of the Biomedical Engineering Division (BmED) of the Institution of Mechanical Engineers (IMechE), one of the largest groups of professional biomedical engineers in the UK. The IMechE has around 115,000 members in 140 countries and has been active since 1847. Biomedical engineering, also known as medical engineering or bioengineering, is the integration of engineering with medical knowledge to help tackle clinical problems and improve healthcare outcomes. Dr Fratini previously served as chair of the Birmingham centre of the division for five years and as vice-chair of the division for one year. His research includes responsible use of AI, 3D segmentation and anatomical modelling to improve surgical training and planning, motor functions and balance rehabilitation. He leads Aston University’s Engineering for Health Research Centre within the College of Engineering and Physical Sciences and has vast experience in the design, development and testing of new medical devices. Currently he is the University’s principal investigator for the West Midlands Health Tech Innovation Accelerator and he has a growing reputation in the UK and internationally within the biomedical engineering profession. He said: “Biomedical engineering is continuously evolving and our graduates will create the future of health tech and med tech for more effective, sustainable, responsible and personalised healthcare. “I am very honoured of this appointment. This three-year post will be a great opportunity to further develop the biomedical engineering profession worldwide and to show Aston University’s commitment to an inclusive, entrepreneurial and transformational impact within the field.” Professor Helen Meese, outgoing chair of the division, said: “I am delighted to see Antonio take on the chair’s position. He has, over the years, contributed significantly to the growth of the Birmingham regional centre and has actively supported me throughout my tenure as chair. I know how passionate he is about our profession and will undoubtedly continue to drive the division forward over the next three years.” Dr Frattini was presented with his new title on 20 June at the IMECHE HQ at 1 Birdcage Walk, London during the Institution’s technology strategy board meeting. For media inquiries in relation to this release, contact Nicola Jones, Press and Communications Manager, on (+44) 7825 342091 or email: n.jones6@aston.ac.uk

3 min

Aston University researcher develops method of making lengthy privacy notices easier to understand

It has been estimated it would take 76 days per year to fully read privacy notices New method makes notices quicker and easier to understand by converting them into machine-readable formats Team designed a JavaScript Object Notation schema which allowed them to validate, annotate, and manipulate documents. An Aston University researcher has suggested a more human-friendly way of reading websites’ long-winded privacy notices. A team led by Dr Vitor Jesus has developed a system of making them quicker and easier to understand by converting them into machine-readable formats. This technique could allow the browser to guide the user through the document with recommendations or highlights of key points. Providing privacy information is one of the key requirements of the UK General Data Protection Regulation (GDPR) and the UK Data protection Act but trawling through them can be a tedious manual process. In 2012, The Atlantic magazine estimated it would take 76 days per year to diligently read privacy notices. Privacy notices let people know what is being done with their data, how it will be kept safe if it’s shared with anyone else and what will happen to it when it’s no longer needed. However, the documents are written in non-computer, often legal language, so in the paper Feasibility of Structured, Machine-Readable Privacy Notices Dr Jesus and his team explored the feasibility of representing privacy notices in a machine-readable format. Dr Jesus said: “The notices are essential to keep the public informed and data controllers accountable, however they inherit a pragmatism that was designed for different contexts such as software licences or to meet the - perhaps not always necessary - verbose completeness of a legal contract. “And there are further challenges concerning updates to notices, another requirement by law, and these are often communicated off-band e.g., by email if a user account exists.” Between August and September 2022, the team examined the privacy notices of 50 of the UK’s most popular websites, from globally organisation such as google.com to UK sites such as john-lewis.com. They covered a number of areas such as online services, news and fashion to be representative. The researchers manually identified the notices’ apparent structure and noted commonly-themed sections, then designed a JavaScript Object Notation (JSON) schema which allowed them to validate, annotate, and manipulate documents. After identifying an overall potential structure, they revisited each notice to convert them into a format that was machine readable but didn’t compromise both legal compliance and the rights of individuals. Although there has been previous work to tackle the same problem, the Aston University team focused primarily on automating the policies rather than data collection and processing. Dr Jesus, who is based at the University’s College of Engineering and Physical Sciences said: “Our research paper offers a novel approach to the long-standing problem of the interface of humans and online privacy notices. “As literature and practice, and even art, for more than a decade have identified, privacy notices are nearly always ignored and ”accepted” with little thought, mostly because it is not practical nor user-friendly to depend on reading a long text simply to access, for example a news website. Nevertheless, privacy notices are a central element in our digital lives, often mandated by law, and with dire, often invisible, consequences.” The paper was published and won best paper at the International Conference on Behavioural and Social Computing, November 2023, now indexed at IEEE Xplore. The team are now examining if AI can be used to further speed up the process by providing recommendations to the user, based on past preferences.

2 min

Aston University optometrists take up global industry association roles

Professor Nicola Logan has been named a global myopia management ambassador by the World Council of Optometry Dr Debarun Dutta is the new academic chair of the British Contact Lens Association Aston University School of Optometry is ranked in the top 10 for research in the Complete University Guide 2024 Professor Nicola Logan and Dr Debarun Dutta from Aston University’s School of Optometry have both been appointed to major roles within optometry industry associations. The School of Optometry is regularly ranked highly by both leading national ranking publications and in annual student-led surveys. This includes a top 10 ranking for research and a top five ranking for graduate prospects in the Complete University Guide 2024, and first in the UK for student/staff ratio in health professions (optometry) in the Guardian University Guide 2024. Professor Logan, professor of optometry and physiological optics and deputy head of the School, has been named a global myopia management ambassador by the World Council of Optometry (WCO). She is one of four new ambassadors named by the WCO in collaboration with CooperVision, a leading myopia management company. WCO and CooperVision have developed a myopia management online tool which reflects WCO’s global standard of myopia care. In March 2024, Professor Logan presented her inaugural lecture at Aston University on her research into the nature of myopia, the growing evidence base on strategies to control eye growth in children and translation of these findings to clinical practice. She said about her appointment as an ambassador: “I am thrilled to be appointed as the global myopia management ambassador for the World Council of Optometry. This role provides me with a valuable platform to advance the recognition of myopia as a significant public health concern and to facilitate the translation of research into effective, evidence-based clinical practice strategies for children with myopia.” Dr Dutta, a lecturer in optometry, has been appointed the new academic chair of the British Contact Lens Association (BCLA). He will lead the BCLA’s academic output, including offering guidance and advice to the BCLA council about scientific and academic elements of contact lenses. Dr Dutta will initially work alongside current academic chair, Professor James Wolffsohn, Aston University’s head of optometry, who is currently on sabbatical from the University, before taking over when Professor Wolffsohn steps down in 2025. Dr Dutta said: “I am hugely excited at the prospect of delivering academic provision of the British Contact Lens Association, with a specific focus on a highly prestigious conference programme as we grow our reputation as a global leader in contact lens and anterior eye education. This is a rare opportunity to work alongside our association members, fellows, trustees, global ambassadors and volunteers inspiring a new era for the BCLA, and to support our growth and development ambitions through delivery of educational activities within the contact lens and anterior eye specialism.”

View all posts