About: With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investigate the robustness of the state of the art automatic short answer grading system proposed by Sung et al. towards cheating in the form of universal adversarial trigger employment. These are short token sequences that can be prepended to students’ answers in an exam to artificially improve their automatically assigned grade. Such triggers are especially critical as they can easily be used by anyone once they are found. In our experiments, we discovered triggers which allow students to pass exams with passing thresholds of [Formula: see text] without answering a single question correctly. Furthermore, we show that such triggers generalize across models and datasets in this scenario, nullifying the defense strategy of keeping grading models or data secret.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investigate the robustness of the state of the art automatic short answer grading system proposed by Sung et al. towards cheating in the form of universal adversarial trigger employment. These are short token sequences that can be prepended to students’ answers in an exam to artificially improve their automatically assigned grade. Such triggers are especially critical as they can easily be used by anyone once they are found. In our experiments, we discovered triggers which allow students to pass exams with passing thresholds of [Formula: see text] without answering a single question correctly. Furthermore, we show that such triggers generalize across models and datasets in this scenario, nullifying the defense strategy of keeping grading models or data secret. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : covidontheweb.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investigate the robustness of the state of the art automatic short answer grading system proposed by Sung et al. towards cheating in the form of universal adversarial trigger employment. These are short token sequences that can be prepended to students’ answers in an exam to artificially improve their automatically assigned grade. Such triggers are especially critical as they can easily be used by anyone once they are found. In our experiments, we discovered triggers which allow students to pass exams with passing thresholds of [Formula: see text] without answering a single question correctly. Furthermore, we show that such triggers generalize across models and datasets in this scenario, nullifying the defense strategy of keeping grading models or data secret.
Subject	Tests Artificial intelligence Patent law Psychometrics Educational psychology Computational linguistics Sports science School examinations
part of	Fooling Automatic Short Answer Grading Systems
is abstract of	Fooling Automatic Short Answer Grading Systems
is hasSource of	covid:ann/target/9a722944321f1547b78fa69056d797c8e9b55af6 covid:ann/target/bc8fb2837794983344adaf635943cd5e9f3fb4af covid:ann/target/8f83f87af7d103f93165aeeaf380935b8ee966f8 covid:ann/target/68565997fecb15c74c90398faeca46bc3a8ca0ae covid:ann/target/c8dda620488139ed95f0aabceb792f34c94fe488 covid:ann/target/db4b9d33cd7857ebece54b20697a0ac9714be826 covid:ann/target/19a3656abfdb0473a52d458bc7677a3728c59f9a covid:ann/target/31a5a6d8f392237784cc04a7c108f2ee0b0d6d0c covid:ann/target/c7c5332fce9ec03a3698bc01ac4c4fc2da59581b covid:ann/target/e670d15e419598addd2f71e8a90837496b593309 covid:ann/target/195404ed79ee2f727e966d4f670b3fce5b80c7e4 covid:ann/target/124fbae00f516947097e91d7df28f2d48245595f covid:ann/target/5b1b483b7f9c3273533b82424eaf5d746337a0e6 covid:ann/target/b5609cd6826b2aa8698dc97bc7810157f173f463 covid:ann/target/e24d10feda2a34bbbe1cd23adf4ccf7f2fe6edc7 covid:ann/target/cae0f40f6897680c6f8b60a2119bb2308653026c covid:ann/target/93a09f9f11d824dd36dcf23656142df6f28b5f96 covid:ann/target/0b30ce71c1556a2d0b35160a7b27c757ab4202d9 covid:ann/target/fbcec95849c05ddba7d7a9e1f11da0eff7c4ad35 covid:ann/target/db8ee17b4d3787564da358a1da6a3481701d238c covid:ann/target/90480d8790721e7e88920aec75a9bbcdfd6c2ce3 covid:ann/target/cceed635744a72594157214e0a4d59ae50ee873b covid:ann/target/93967ce46f81611e3bbb34ac5dc79a2d3d71d18f covid:ann/target/d1ad350eb934a3e67b4432406a1b04c7fcfef038 covid:ann/target/65f73bc620a2ad6ff90c5ebfabe35f04bca44b6c covid:ann/target/8ebb6986510ffe8f3746fbd830491e567dcd84c1 covid:ann/target/d07bcbaa7f3a068abd8f213cdc7e84723b7fad98 covid:ann/target/059cee16afa3723e9ff4990a1825584ad5977c1b covid:ann/target/972106bcb67c4bde4d6077ca3991924d22eb1775 covid:ann/target/66fb79897b24e8fd662af4e191c7ac13ef361cd3 covid:ann/target/c620fca203b6b1b1d1c11c64f430b96029a47e5b covid:ann/target/e1f6fc7aff10b71c5cf98d3fc202f43ad7a0acab covid:ann/target/bfce717e6b943fe7c20adec35f70008abbd61d0e covid:ann/target/09d0ff14c316be938d601a70eeac506841515b31 covid:ann/target/6a29888ef9b8002ddecfa64471b9c3f48d68f735 covid:ann/target/300a0ed04a8a8822e6c21d73ca3be38b060b15ef covid:ann/target/446d3c52812449d7ea1731fb06357b01da707f7f covid:ann/target/40342ee016d9a0722d6bc9237b8ce3f9a5b5057f covid:ann/target/77760d0e8388ded9d3c189b98caba2104d39bea3 covid:ann/target/8270707dc0a251e8963ad7e45429446fcd58a938 covid:ann/target/6e539993586d3a125c58f401f162bdb47e1cd59a covid:ann/target/3d1d984d3132408450865c22f1182b9efd194dda covid:ann/target/ad577ee25564b9e925c54b2bb0314eef6786df7d covid:ann/target/2c5183643686864a112c670ae42c9d0205757400 covid:ann/target/9112d7888a5d124a0da4eee300ad81789f145775 covid:ann/target/7712d6b97be84628d690d546a0197f61be0246fe covid:ann/target/24f9cc8a73fd81202a9032fe22e598a53ed59f95 covid:ann/target/f6d770da5a191b3215b4438b1917b39390870e06 covid:ann/target/96526c1e10beefa273e659d8ec34bfc95d3d8a0a covid:ann/target/1b9fadda6c44b2e83350e15e627c4acc327db4a5 covid:ann/target/7eb72eb9eaf3a08ed038d9cc40a90c01235d8ed0 covid:ann/target/01ede4e18abe5886df48d6834481f6c6ae3d5729 covid:ann/target/101899d9cde0fc2ccf0c9ff34a43eba0935a2916 covid:ann/target/1d0523be9d055863259c8b27c3b78b47ad10dad3 covid:ann/target/235c6060d1becc26bb8b816b6a653c2fb9404691 covid:ann/target/5ed7558df252aca582fe2e1f721438f460c6c463 covid:ann/target/f6d015dbda302824e83f579a4e20a47fcfcb45e8 covid:ann/target/2c280f7a8ba9502c834d653a9a14342c57bd451d covid:ann/target/331966fa106eb9641772a88a65872c045d8cc3f3 covid:ann/target/2c7fa1402d7139d20a2082300470c56e0d643118 covid:ann/target/ae42560477e10f2080199bccf9aa3c5cc695ac17 covid:ann/target/261fd65c26b3c3819d2d5b97d2b52bfde5fbf91a covid:ann/target/d8cf006ef535575d855db3d6bdaf508a17a605a3 covid:ann/target/00cefbfe5ee75ed7543fcaf4a1ae6b1e68989562 covid:ann/target/c0bcfdd0582274ee426a7f4b0c64012a2bde1687 covid:ann/target/e00a4dfe2129249d52f954ed0dd5102102b17a6b covid:ann/target/6b4b80d09aeaaf8c9685c495df8a5d2d4f062bbd covid:ann/target/962f1f328e52bd72a92e816bbb0b972744d875b7 covid:ann/target/b90425e6d9e18b2ee11ba8bd1eb5f6f933a6f579 covid:ann/target/bb564c0c12c5317a2f6c788cc59d9ddbd2843001 covid:ann/target/161182a6a7d67ee6cab6d49617daefdb58cfac93 covid:ann/target/9b43ad8c37075363abbb794459552ff34e4776fd covid:ann/target/f7356293ae1004ad5dab3d392d06dd065534abc3 covid:ann/target/6abc48a4751cd3b86c4c92e22c5587ed4bd99d89 covid:ann/target/03e559f3e5e075ad97e6eab63d9e57ea0ef8a393 covid:ann/target/6ca423bdeed92aa9e7a45e6b7f55eaa42fb39626 covid:ann/target/0bf73282a0670a054ad3785a7233d05de89cf159 covid:ann/target/39719a476829c568efc3b1f320ab95c1957ac912 covid:ann/target/513693abddbb92eb76a59fb140ff3b2f3da1f46e covid:ann/target/0a536981393e75d40eadef027a61202791337b0b covid:ann/target/840de7e4601e85a615bba58e8c1c43c22e55cbf3 covid:ann/target/888cbef7b79266196684616e6b715885232b1abc covid:ann/target/dc6dba0b831576552833fcda936c5f760ba907b4 covid:ann/target/e4f12a45a0b53298e63b33e8539d16e90140970a covid:ann/target/333e3c9c45f8d31ce96d6caa79ff29f049541da0 covid:ann/target/4907347eb71bda74ee36c1fdab9d202849ccdabf covid:ann/target/d9138396458b57cdcc14f15708b547a7123159dd covid:ann/target/071706cb4146bbb573f688b3d8104317cb11ca42

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software