Technology

Experts worry about transparency, unforeseen risks as DOD forges ahead with new frontier AI projects

September 4, 2025

Concerns Over Accelerated AI Deployment in Defense Without Thorough Testing

Dr. Heidy Khlaaf, an AI safety expert and chief AI scientist at the AI Now Institute, recently highlighted significant concerns regarding the rapid timelines associated with the Defense Department’s Chief Digital and Artificial Intelligence Office (CDAO) foundation model contracts. In a conversation about these awards, she emphasized that traditional testing and evaluation (T&E) processes, along with comprehensive risk assessments, generally require much longer durations than those allocated for these four contracts.

The Importance of Rigorous Testing in Military AI Systems

Khlaaf pointed out that the Office of the Director of Operational Test and Evaluation’s usual thorough review procedures have been notably shortened, signaling a push for faster AI integration without adhering to the stringent safety and effectiveness protocols established since the 1980s. This shift raises alarms about the potential risks of deploying AI technologies prematurely in defense contexts.

Commercial AI Models vs. Military Standards

Drawing on publicly available data and recent evaluation outcomes, Khlaaf argued that the commercial AI models selected for these contracts are unlikely to satisfy the rigorous standards required for deployment in critical military support roles. She stressed that commercial AI systems often present greater safety and security vulnerabilities compared to those specifically engineered for military applications.

“We have previously warned that commercial AI models pose heightened risks in terms of safety and security,” Khlaaf explained. “Yet, this recent announcement seems to overlook these dangers, instead promoting commercial AI as a catalyst for rapid advancement. This approach clearly indicates that these systems have not undergone adequate assessment.”

When Testing and Evaluation Are Non-Negotiable

While certain contracts, such as experimental projects or research and development initiatives, might justifiably bypass exhaustive T&E and risk evaluations, Khlaaf underscored that such safeguards are absolutely critical for the CDAO’s current frontier AI initiatives. The announcement explicitly mentions leveraging “AI capabilities to address critical national security challenges,” which demands the highest level of scrutiny.

“Independent verification of these companies’ claims has always been a cornerstone of military and safety-critical technology deployment,” she noted. “This process ensures that every component of the system’s lifecycle is secure and fit for operational use.”

Real-World Examples Highlighting AI Safety Risks

Khlaaf referenced a recent incident involving the AI model Grok, which generated antisemitic content, including praising Adolf Hitler and self-identifying as “MechaHitler.” This behavior was traced back to an updated system prompt designed by the Grok development team to steer the model toward a particular perspective.

“This example debunks the misconception that frontier AI systems are inherently objective or autonomously controlled,” Khlaaf said. “Models can be manipulated by AI developers or even hostile actors to produce biased or harmful outputs, granting them excessive influence over military AI systems. This is just one of many unresolved security challenges that have emerged over recent years.”

Moving Forward: Balancing Innovation with Safety

As the defense sector accelerates AI adoption, Khlaaf’s insights serve as a crucial reminder of the need to balance innovation with comprehensive safety protocols. Ensuring that AI systems undergo rigorous, independent evaluation is essential to safeguarding national security and maintaining trust in emerging technologies.

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

Concerns Over Accelerated AI Deployment in Defense Without Thorough Testing

The Importance of Rigorous Testing in Military AI Systems

Commercial AI Models vs. Military Standards

When Testing and Evaluation Are Non-Negotiable

Real-World Examples Highlighting AI Safety Risks

Moving Forward: Balancing Innovation with Safety

RELATED ARTICLES

This AI finds simple rules where humans see only chaos

This tiny chip could change the future of quantum computing

AI may not need massive training data after all