Towards Transfer Learning Techniques-BERT, DistilBERT, BERTimbau, and DistilBERTimbau for Automatic Text Classification from Different Languages: A Case Study.

Researchers

Ademar Takeo Akabane Rafael Silva Barbon

Journal

Modalities

Models

BERT BERTimbau DistilBERT DistilBERTimbau

Abstract

The Internet of Things is a paradigm that interconnects several smart devices through the internet to provide ubiquitous services to users. This paradigm and Web 2.0 platforms generate countless amounts of textual data. Thus, a significant challenge in this context is automatically performing text classification. State-of-the-art outcomes have recently been obtained by employing language models trained from scratch on corpora made up from news online to handle text classification better. A language model that we can highlight is BERT (Bidirectional Encoder Representations from Transformers) and also DistilBERT is a pre-trained smaller general-purpose language representation model. In this context, through a case study, we propose performing the text classification task with two previously mentioned models for two languages (English and Brazilian Portuguese) in different datasets. The results show that DistilBERT’s training time for English and Brazilian Portuguese was about 45% faster than its larger counterpart, it was also 40% smaller, and preserves about 96% of language comprehension skills for balanced datasets.

Show Full Text

Towards Transfer Learning Techniques-BERT, DistilBERT, BERTimbau, and DistilBERTimbau for Automatic Text Classification from Different Languages: A Case Study.

Researchers

Journal

Modalities

Models

Abstract

Simplified Transfer Learning for Chest Radiography Models Using Less Data.

Data-driven fault detection and isolation of nonlinear systems using deep learning for Koopman operator.

Joint Data Transmission and Energy Harvesting for MISO Downlink Transmission Coordination in Wireless IoT Networks.

Improving the Efficacy of Deep-Learning Models for Heart Beat Detection on Heterogeneous Datasets.

Application of machine learning techniques to the analysis and prediction of drug pharmacokinetics.

Achieving efficient interpretability of reinforcement learning via policy distillation and selective input gradient regularization.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply