IEEE P3168

IEEE Approved Draft Standard for Robustness Evaluation Test Methods for a Natural Language Processing Service that uses Machine Learning

standard by IEEE , 06/19/2024

View all product details

Most Recent

Track It

- Available Formats
  ℹ
- Options
- Availability
- Priced From ( in USD )
- PDF
- 👥
- P3168/D3, Aug 2023 - APPROVED DRAFT
- $56.00
- Add to Cart

Customers Who Bought This Also Bought

IEEE C62.21-2003
Priced From $110.00
IEEE/ISO/IEC 29148-2018
Priced From $145.00
IEEE C37.122.6-2013
Priced From $81.00
IEEE 1854-2019
Priced From $83.00

Full Description

Scope

This standard specifies test methods for evaluating the robustness of a Natural Language Processing (NLP) service that uses machine learning. Models of NLP generally feature an input space being discrete and an output space being almost infinite in some tasks. The robustness of the NLP service is affected by various perturbations including adversarial attacks. A methodology to categorize the perturbations, and test cases for evaluating the robustness of an NLP service against different perturbation categories is specified. Metrics for robustness evaluation of an NLP service are defined. NLP use cases and corresponding applicable test methods are also described.

Purpose

The purpose of the standard is to provide test methods for evaluating the robustness of an NLP service. Test methods are used by service developers, service providers and service users to determine the robustness of an NLP service.

Abstract

New IEEE Standard - Active - Draft. The Natural Language Processing (NLP) services using machine learning have rich applications in solving various tasks, and have been widely deployed and used, usually accessible by API calls. The robustness of the NLP services is challenged by various well-known general corruptions and adversarial attacks. Examples of general corruptions include inadvertent or random deletion, addition, or repetition of characters or words. Adversarial attacks generate adversarial characters, words or sentence samples causing the models underpinning the NLP services to produce incorrect results. This standard proposes a method for quantitatively evaluating the robustness the NLP services. Under the method, different cases the evaluation needs to perform against are specified. Robustness metrics and their calculation are defined. With the standard, the service stakeholders including the service developer, service providers, and service users can develop understanding of the robustness of the services. The evaluation can be performed during various phases in the life cycle of the NLP services, the testing phase, in the validation phase, after deployment, etc.

Product Details

Published:: 06/19/2024
ISBN(s):: 9798855704785
Number of Pages:: 27
File Size:: 1 file , 1.3 MB
Product Code(s):: STDAPE26745
Note:: This product is unavailable in Russia, Belarus

Device/OS:	Other
Browser:	Generic Browser 0.0
User Agent:	Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; [email protected])
Store Name:	css
Page:	/standards/ieee-p3168?product_id=2580337
Referrer:	www.techstreet.com./standards/ieee-p3168?product_id=2580337
IP:	18.222.107.236, 172.71.254.202
Language:	en
Customer #:	Not Logged In
Member?:	YES
Cart #:	1159092549
Order #:	None
Cookies:	YES
×

Our policy towards the use of cookies

What is a Secured PDF?

What does this mean?

What can you do with a Secured PDF?

Restrictions: