Exploring the Utility of Developer Exhaust.

Researchers

Christopher Ré Jian Zhang Kunle Olukotun Luigi Nardi Max Lam Paroma Varma Stephanie Wang

Journal

Proceedings of the Second Workshop on Data Management for End-to-End Machine Learning. Workshop on Data Management for End-to-End Machine Learning (2nd : 2018 : Houston, Tex.)

Modalities

Models

LSTM

Abstract

Using machine learning to analyze data often results in developer exhaust – code, logs, or metadata that do not define the learning algorithm but are byproducts of the data analytics pipeline. We study how the rich information present in developer exhaust can be used to approximately solve otherwise complex tasks. Specifically, we focus on using log data associated with training deep learning models to perform model search by predicting performance metrics for untrained models. Instead of designing a different model for each performance metric, we present two preliminary methods that rely only on information present in logs to predict these characteristics for different architectures. We introduce (i) a nearest neighbor approach with a hand-crafted edit distance metric to compare model architectures and (ii) a more generalizable, end-to-end approach that trains an LSTM using model architectures and associated logs to predict performance metrics of interest. We perform model search optimizing for best validation accuracy, degree of overfitting, and best validation accuracy given a constraint on training time. Our approaches can predict validation accuracy within 1.37% error on average, while the baseline achieves 4.13% by using the performance of a trained model with the closest number of layers. When choosing the best performing model given constraints on training time, our approaches select the top-3 models that overlap with the true top- 3 models 82% of the time, while the baseline only achieves this 54% of the time. Our preliminary experiments hold promise for how developer exhaust can help learn models that can approximate various complex tasks efficiently.

Show Full Text

Exploring the Utility of Developer Exhaust.

Researchers

Journal

Modalities

Models

Abstract

Exploring the Potential of Artificial Intelligence and Machine Learning to Combat COVID-19 and Existing Opportunities for LMIC: A Scoping Review.

The Emerging Trends of Multi-Label Learning.

Deep Learning-Based Concrete Surface Damage Monitoring Method Using Structured Lights and Depth Camera.

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction.

Using UAV Images and Deep Learning in Investigating Potential Breeding Sites of Aedes albopictus.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply