Test Development & Validation

Selected projects across test development, item writing, benchmarking/alignment, equating, validation, and operational monitoring.
Click test name for external site and role for more information in a drop down box.

Cambridge Assessment LTRC OET logo Linguaskill logo Babel logo CNaVT logo NAATI Kaplan CRELLA EnglishScore Edumaxi logo
Test: CLA Test | Role: Test Development Project Lead | LTRC | Client: NAATI

The Community Language Test (CLA) is a LOTE (Languages other than English) test for interpreters in low-stakes settings. LTRC commissioned by NAATI (National Accreditation Authority for Translators and Interpreters). The test enables people to demonstrate their basic oral communication skills in a language other than English, and their ability to support clients, students or stakeholders in a two-way conversation.

Role: With LTRC guidance and NAATI input and support, I designed, trialled, and benchmarked the test. Additionally, I developed both training materials for examiners and practice / preparation material for candidates.
About this test

LTRC logo NAATI logo
Test: Unnamed Large-Scale Proficiency Exam | Role: Item Development Project Lead | LTRC | Client: (withheld)

In long-term cooperation with the LTRC, this Japanese test developer commissioned item development for their series of tests benchmarked from A1 to B2 across four skill areas. The test is used in a range of contexts, including in primary and secondary school settings, and for university entrance. The test is delivered in large volumes, with regular version renewal cycles, and is used by a wide range of learners across the country. and as university language entrance or placement exams.

Role: I lead this team, which included one other dedicated test developer and item reviewer, and a range of freelance item writers which I recruited and trained. My main duties involved coordinating the team, training and supervising item writers, liaising with the client, and overseeing contract delivery. This was a long-term project lasting from 2016 to 2020.
About this test

LTRC logo
Test: KTE | Role: Item Development Quality Assurance | Client: Kaplan Testing Services (KSE)

This project involved development ofreading and listening texts and items for B2-C2 levels in the Kaplan Test of English (KTE), an adaptive English language proficiency test.

Role: I provided quality assurance for C2-level items, working closely with and providing detailed feedback to the C1 item writer. I was responsible for reviewing items, ensuring the principles of good item writing were followed and the items were leveled appropriately and adhered to the specifications of the test.
About this test

Kaplan logo
Test: DELNA | Role: Item Development and Evaluation | LTRC | Partner: The University of Auckland

The DELNA (Diagnostic English Language Needs Assessment) is a post-entry diagnostic language test developed and used at the University of Auckland and the University of Melbourne. The DELNA is used to identify students who may require further language support to succeed at university.

Role: As part of my work with the LTRC, I helped develop, trial, evaluate and equate several versions of the DELNA (Diagnostic English Language Needs Assessment). This is a post-entry diagnostic language test used at the University of Auckland and the University of Melbourne which is used to identify students who may require further language support to succeed at university.
About this test

LTRC logo University of Auckland logo University of Melbourne logo
Test: OET | Role: New Item Trialing and Paneling | LTRC | Client: Cambridge Boxhill Language Assessments (CBLA)

The Occupational English Test (OET) is a profession-specific English proficiency test for healthcare professionals, used mainly for registration and migration in countries like the UK, Australia, and Ireland.

Role: As part of the LTRC team, I helped with the paneling, trialing and analysis of the new form of the OET, which was changed as part of the handover from the LTRC to CBLA.
About this test

OET logo Cambridge logo
Test: AEST | Role: Item Development, Item Trialing, New Form Equating | LTRC

The online Academic English Screening Test (AEST) is designed to provide a means of identifying those students who are likely to experience difficulties in coping with the English language demands of their academic study. The test can be used as an alternative to the more time consuming DELA. The test was designed on the assumption of universal testing rather than targeting of particular categories of student. A number of Australian universities use the AEST under site licence.

Role: I wrote items for the AEST, which has a c-test part and a cloze-elide text in which extra words are added to a lengthy text. I was also involved in the trialling and revising of the items based on trial data.
About this test

University of Melbourne logo LTRC logo
Test: Unnamed College English Exam | Role: Item Writer | LTRC | Client: (withheld)

As part of a project in cooperation with a Chinese test developer, I was part of a team which developed high-level reading and listening items aligned with the Standards of English (CSE).

Role: Item development, text writing, liaising with overseas client.

LTRC logo
Test: EnglishScore | Role: CEFR Benchmarking and Validation | CRELLA | Client: British Council

EnglishScore provides evidence of proficiency in understanding and using English in everyday life and the workplace. The test is primarily concerned with the occupational, public and personal domains. The test is delivered via a mobile app. Productive scores are reached via the use of both AI and peer raters. The test is benchmarked to the CEFR and developed in partnership with the Centre for Research in English Language Learning and Assessment (CRELLA) at the University of Bedfordshirend.

Role: Working with CRELLA, I was part of a benchmarking project for EnglishScore items. Afterwards, I was recruited by EnglishScore directly for a six month role in which I provided consultation and assistance with other validation activities.
About this test

EnglishScore logo CRELLA logo
Test: Linguaskill | Role: Item Evaluation | Edumaxi | Client: British Council

Linguaskill is a computer-based, on-demand English proficiency test developed by Cambridge Assessment. It tests reading, listening, writing, and speaking for workplace and institutional contexts.

Role: Working with Edumaxi, a New Zealand-based test and curricula developer, I worked with a team to provide Linguaskill and Linguaskill Business with feedback on older items as part of a rejuvination of the test. This involved evaluation, feedback and redrafting of items in the reading and listening sections of the test for several CEFR levels.
About this test

Linguaskill logo Edumaxi logo
Test: AKV en Nederlands voor medici (B2+) | Role: Test Development and Validation | Client: Babel Utrecht

All medical professionals with a foreign education or training need to demonstrate appropriate Dutch and English language skills in order to work in the Netherlands. Before 2024, this was possible by taking the AKV (Algemene Kennis en Vaardigheden) test, which was a test of general Dutch language skills. In 2024, the AKV was replaced by a new test, the AKV en Nederlands voor medici (B2+), which is a test of both general Dutch language skills and medical Dutch language skills. The test is designed to assess the language skills of medical professionals. The AKV was developed and delivered by Babel Utrecht, a language training and testing provider based in the Netherlands. The AKV was discontinued, and a replacement for B2+ requirements was developed by Babel, called Nederlands voor medici (B2+).

Role: Prior to 2024, I was contracted by Babel as an external expert to evaluate the reliability and validity of the AKV, a requirement for the ministry overseeing the registration of medical professionals in the Netherlands. After the AKV was discontinued, I was engaged to provide feedback on test development and design for the replacement test.
More information about the Dutch registry for medical professionals and language requirements.

Babel logo
Test: CNaVT | Role: CEFR Benchmarking | Client: Nederlandse Taalunie

The CNaVT is an international, CEFR-aligned Dutch language proficiency test designed for non-native speakers to demonstrate their ability to use Dutch in academic, professional, or everyday contexts.

Role: As part of ongoing test development and validation, I was invited to take part in an exercise evaluating newly developed lower-level items for their appropriacy as A2-level items.

CNaVT logo Taalunie logo
AEAS — routine item/material development (school sector) Australian Education Assessment Services (AEAS) — 2016–2017

Reading, vocabulary, and listening materials for school-age learners.

Role keywords: Item writer • Materials development • Reading/listening/vocabulary • Team-based production • Version updates (routine)

RADAr — post-entry diagnostic assessment: monitoring, equating & validation Radboud University — 2022–present

Ongoing evaluation of item functioning and form comparability; policy-facing reporting.

Role keywords: Item performance analysis • Reliability analysis • Form equating • Predictive validity analysis • Internal validation report author (2025) • Diagnostic interpretation • Policy reporting

Revision Tools Test (RTT) — diagnostic assessment of writing with automated feedback Radboud University — 2022–present

Diagnostic test of writers’ decisions when interacting with spelling/grammar suggestions and surface-level feedback.

Role keywords: Test designer • Construct innovation (human–AI/tool interaction) • Subskill framework • Item writing • Scoring logic design • Trial study design • Psychometric analysis • Diagnostic feedback framework