GLOBAL RESEARCH SYNDICATE
No Result
View All Result
  • Login
  • Latest News
  • Consumer Research
  • Survey Research
  • Marketing Research
  • Industry Research
  • Data Collection
  • More
    • Data Analysis
    • Market Insights
  • Latest News
  • Consumer Research
  • Survey Research
  • Marketing Research
  • Industry Research
  • Data Collection
  • More
    • Data Analysis
    • Market Insights
No Result
View All Result
globalresearchsyndicate
No Result
View All Result
Home Data Collection

Why Congress, DoD should focus on training data platforms to make AI tools more valuable

globalresearchsyndicate by globalresearchsyndicate
June 15, 2020
in Data Collection
0
Why Congress, DoD should focus on training data platforms to make AI tools more valuable
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter

All artificial intelligence, whether it be for fighting coronavirus or fighting future drone swarms, currently depends on one thing: Quality training data. Good data means the difference between missing a moving target and hitting it squarely between the eyes. And developing good data requires a training data platform (TDP), software designed to manage vast amounts of data so that it can be read by AI systems.

While data scientists across the government and private sector know this well, it is imperative that Congress and senior military leaders understand it, too, because collecting and preparing quality training data takes time and money. Allocating time and money, meanwhile, requires informed leaders.

The Pentagon’s Joint Artificial Intelligence Center (JAIC) is creating a platform that will provide Defense Department data scientists access to datasets, code libraries and other certified platforms to speed development and deployment of AI-enabled systems.

The National Security Commission on AI, meanwhile, has recommended that Congress establish a National AI Research Resource that would include a searchable collection of datasets available for the development of machine-learning models for national security solutions.




Both of these initiatives are critical if this country hopes to maintain its advantage over adversaries, particularly China, because AI will likely determine which country wins in the economic realm and the national security realm.

There are three basic components to AI as we know it today: algorithms, data and computational power.

Algorithms are largely in the public domain.

Computational power is ubiquitous thanks to cloud providers such as Amazon Web Services, which allow anyone with a credit card and an Internet connection to access massive collections of high-speed computers from anywhere in the world.

Data, particularly labeled data, is the most critical and most proprietary piece in AI systems.

There are many kinds of AI, but currently the most effective – what almost everyone is talking about when they say “AI” – is supervised learning. In supervised learning, networks of algorithms, written in massive blocks of computer code, are taught what patterns they should recognize, whether it be enemy camps in drone footage or signs that a truck is about to break down.

In order to teach the algorithms what to look for, they are fed tens of thousands, even millions, of data points carefully labeled by humans. After seeing thousands of military encampments, for example, along with thousands of images that may look like military encampments, but are not, the algorithms become expert at spotting the real thing faster and more accurately than humans.

Preparing data for machine intelligence requires a TDP built to keep thousands, tens of thousands or millions of data files organized with an intuitive interface that follows many of the conventions of consumer software. It coordinates access to that data by hundreds or thousands of human labelers.

But a good TDP does much more: It gives data scientists the ability to discover bias in datasets and correct them. For example, by monitoring imbalances in the dataset it can alert data science teams of the need to collect more data on so-called corner cases, relatively rare situations that nonetheless algorithms should learn to recognize.

A good TDP itself learns what to look for in the data and pre-labels data so labelers need only verify accuracy, speeding the process. It allows for easy training of data labelers and provides quality control features that can identify labelers who are making errors and require more training. And a good TDP allows version control of datasets and creates an audit trail so that data science teams can roll back the dataset if its accuracy drifts, or spot where problematic changes occurred.

China will soon produce more data each year than the any other country, according to international market research firm International Data Corp. Thanks to its “military-industrial fusion,” much of the data collected by pervasive commercial services is available to the country’s national security establishment.

The JAIC’s Joint Foundation Center is a good start toward countering that advantage. The NSCAI’s recommendation to create a National AI Research Resource would be an even bigger step.

But not only does the national security establishment need data, searchable and accessible rather than siloed, it needs to label that data appropriately. The JAIC knows how to do this, as do other discreet teams throughout the Defense Department. But the intelligence community remains married to sophisticated labeling protocols that are not machine readable and not useful for AI models.

There are roughly 18,000 analysts in the U.S. intelligence communities, many of whom peruse carefully labeled data that has been collected now for decades. But 18,000 analysts are not enough to capture the insights coming from all of the data being collected today.

Satellites capture images of every point on earth daily. Thousands of manned surveillance flights and unmanned drones record images of video feeds from all over the world, some with a resolution as fine as a few centimeters. Fused with chatlogs, phone intercepts, radio traffic and emails, this data can give the U.S. remarkable, near real-time visibility about what is happening in the world. AI tools now exist to scan all of that data and flag anomalies, narrowing the space for human analysts to focus on.

But in order to train AI systems to do the work, a subset of that data needs to be appropriately labeled – not for humans, but for machines. Already, the Defense Department is labeling drone footage for AI. Project Maven is the best-known effort.

But the intelligence communities continue to work with electronic light tables to produce data that, while in many ways more sophisticated than standard AI data, are not consumable by AI systems. The national security establishment would benefit greatly from a machine-readable labeling protocol that would fit unobtrusively into the intelligence community’s current practice.

The pre-labeling feature of a good TDP could be adapted to take human labeled data from the intelligence community’s electronic light tables and pre-label it for AI systems, making use of the decades of legacy data labeled for human analysts.

Quality labeled datasets are the key to the accuracy of AI systems. The national security establishment needs a uniform labeling process to ensure datasets meet quality standards that will make U.S. AI systems as accurate as possible. Congress should adopt the NSCAI’s recommendations for a National AI Research Resource in the 2020 National Defense Authorization Act and standardize data labeling across the US government.

Manu Sharma is co-founder and CEO of Labelbox, an AI platform development company, and an aerospace engineer.

Related Posts

How Machine Learning has impacted Consumer Behaviour and Analysis
Consumer Research

How Machine Learning has impacted Consumer Behaviour and Analysis

January 4, 2024
Market Research The Ultimate Weapon for Business Success
Consumer Research

Market Research: The Ultimate Weapon for Business Success

June 22, 2023
Unveiling the Hidden Power of Market Research A Game Changer
Consumer Research

Unveiling the Hidden Power of Market Research: A Game Changer

June 2, 2023
7 Secrets of Market Research Gurus That Will Blow Your Mind
Consumer Research

7 Secrets of Market Research Gurus That Will Blow Your Mind

May 8, 2023
The Shocking Truth About Market Research Revealed!
Consumer Research

The Shocking Truth About Market Research: Revealed!

April 25, 2023
market research, primary research, secondary research, market research trends, market research news,
Consumer Research

Quantitative vs. Qualitative Research. How to choose the Right Research Method for Your Business Needs

March 14, 2023
Next Post
Wireless Audio Speakers MARKET (IMPACT OF COVID-19) SEGMENTATION, SWOT ANALYSIS, OPPORTUNITIES AND FORECAST TO 2025 |  Texas Instruments, Samsung, Sony, HP, Creative

Trending News:Covid-19 impact on Chlamydia Infection Market Research Report Analysis and Forecast till 2025| – 3w Market News Reports

Categories

  • Consumer Research
  • Data Analysis
  • Data Collection
  • Industry Research
  • Latest News
  • Market Insights
  • Marketing Research
  • Survey Research
  • Uncategorized

Recent Posts

  • Ipsos Revolutionizes the Global Market Research Landscape
  • How Machine Learning has impacted Consumer Behaviour and Analysis
  • Market Research: The Ultimate Weapon for Business Success
  • Privacy Policy
  • Terms of Use
  • Antispam
  • DMCA

Copyright © 2024 Globalresearchsyndicate.com

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT
No Result
View All Result
  • Latest News
  • Consumer Research
  • Survey Research
  • Marketing Research
  • Industry Research
  • Data Collection
  • More
    • Data Analysis
    • Market Insights

Copyright © 2024 Globalresearchsyndicate.com