publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model LeaderboardsarXiv preprint arXiv:2402.01781, 2024
- A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and RecommendationsarXiv preprint arXiv:2407.04069, 2024
-
2023
- Mixture of domain experts for language understanding: An analysis of modularity, task performance, and memory tradeoffsIn 2022 IEEE Spoken Language Technology Workshop (SLT), 2023
- Low-Resource Compositional Semantic Parsing with Concept PretrainingarXiv preprint arXiv:2301.09809, 2023
- Controlling the Extraction of Memorized Data from Large Language Models via Prompt-TuningarXiv preprint arXiv:2305.11759, 2023
2022
- RescoreBERT: Discriminative Speech Recognition Rescoring With BertIn ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
- Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing ModelsIn Proceedings of the ACM Web Conference 2022, 2022
- Controlled Data Generation via Insertion Operations for NLUIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022
- Alexa teacher model: Pretraining and distilling multi-billion-parameter encoders for natural language understanding systemsIn Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022
- AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelarXiv preprint arXiv:2208.01448, 2022
-
2021
- ASR N-Best Fusion NetsIn ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021
- Output Randomization: A Novel Defense for both White-box and Black-box Adversarial ModelsarXiv preprint arXiv:2107.03806, 2021
- Limitations of Knowledge Distillation for Zero-shot Transfer LearningIn Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021
2020
- Don’t Parse, Insert: Multilingual Semantic Parsing with Insertion Based DecodingarXiv preprint arXiv:2010.03714, 2020
- Compressing Transformer-Based Semantic Parsing Models using Compositional Code EmbeddingsarXiv preprint arXiv:2010.05002, 2020
- Using multiple ASR hypotheses to boost i18n NLU performancearXiv preprint arXiv:2012.04099, 2020
2019
-
- Deep density ratio estimation for change point detectionarXiv preprint arXiv:1905.09876, 2019
- Thwarting finite difference adversarial attacks with output randomizationarXiv preprint arXiv:1905.09871, 2019
- Predicting Change Points in Multivariate Time Series DataRensselaer Polytechnic Institute, 2019
- Optimal Mini-Batch Size Selection for Fast Gradient DescentarXiv preprint arXiv:1911.06459, 2019
- Generation & Evaluation of Adversarial Examples for Malware ObfuscationIn 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), 2019
2018
- Learning filter widths of spectral decompositions with waveletsAdvances in Neural Information Processing Systems, 2018
2017
- Focal onset seizure prediction using convolutional networksIEEE Transactions on Biomedical Engineering, 2017
2014
- Interaction Recognition Using Sparse PortraitsIn 2014 22nd International Conference on Pattern Recognition, 2014