Skip to content

DAEMA: Denoising Autoencoder with Mask Attention

Missing data is a recurrent and challenging problem, especially when using machine learning algorithms for real-world applications. For this reason, missing data imputation has become an active research area, in which recent deep learning approaches have achieved state-of-the-art results. We propose DAEMA: Denoising Autoencoder with Mask Attention, an algorithm based on a denoising autoencoder architecture with an attention mechanism.
While most imputation algorithms use incomplete inputs as they would use complete data – up to basic preprocessing (e.g. mean imputation) – DAEMA leverages a mask-based attention mechanism to focus on the observed values of its inputs.
We evaluate DAEMA both in terms of reconstruction capabilities and downstream prediction and show that it achieves superior performance to state-of-the-art algorithms on several publicly available real-world datasets under various missingness settings.

The paper won the third-best paper award of ICANN 2021! It is freely accessible in its preprint form: https://arxiv.org/abs/2106.16057.

Simon Tihon*, Muhammad Usama Javaid*, Damien Fourure, Nicolas Posocco, Thomas Peel, DAEMA: Denoising Autoencoder with Mask Attention, In Proc. of the The 30th International Conference on Artificial Neural Networks, 2021.

* equal contributions

Watch the presentation on YouTube.

Releated Posts

Evaluation of GraphRAG Strategies for Efficient Information Retrieval

Traditional RAG systems struggle to capture relationships and cross-references between different sources unless explicitly mentioned. This challenge is common in real-world scenarios, where information is often distributed and interlinked, making graphs a more effective representation. Our work provides a technical contribution through a comparative evaluation of retrieval strategies within GraphRAG, focusing on context relevance rather than abstract metrics. We aim to offer practitioners actionable insights into the retrieval component of the GraphRAG pipeline.
Read More

Flight Load Factor Predictions based on Analysis of Ticket Prices and other Factors

The ability to forecast traffic and to size the operation accordingly is a determining factor, for airports. However, to realise its full potential, it needs to be considered as part of a holistic approach, closely linked to airport planning and operations. To ensure airport resources are used efficiently, accurate information about passenger numbers and their effects on the operation is essential. Therefore, this study explores machine learning capabilities enabling predictions of aircraft load factors.
Read More