Second Spring School Big Data Analytics

EURA NOVA Research Center is both proud and happy to lead the Second Spring School Big Data Analytics that will be held in Tunis, from the 20th to the 22nd of March 2018. Sabri Skhiri and Aymen Cherif will talk about their favorite topics:

  • Deep Learning
  • TensorFlow
  • CNN Architecture
  • Unsupervised Learning
  • Complex Event Processing
  • Stream processing & micro-services

 

Check out the complete agenda and register on the event website : https://sites.google.com/view/ssbda2018/welcome

The conference is organised by the Ecole Polytechnique de Tunisie.

The Next Activities of our R&D Centre in Marseille

The French branch of EURA NOVA will take part in two great tech events in the following days and weeks.

 

On the 22nd of February, data scientist Thomas Peel will give a talk titled “Machine Learning à l’ère du RGPD” (Machine learning and the General Data Protection Regulation) on the opening day of the Colloquium intelligence artificielle, machine learning, data science to be held at the grand amphitheatre of the Saint-Charles campus in Marseille. Other great speakers from INRIA, Google, Provence Innovation, and Criteo will be featured. The event is free but registration is mandatory.

 

Practical information:

What? Colloquium intelligence artificielle, machine learning, data science

When? Thursday 22nd of February

Where? Grand amphithéâtre, campus Saint-Charles, – 3, place Victor Hugo – case 39 – 13331 MARSEILLE Cedex 03

Registration: : https://framaforms.org/conferences-ia-data-science-machine-learning-i2mlis-1518019875

 

On the 12th of March, the French branch of EURA NOVA is organising the Marseille Community Event, supported by the Neo4j GraphTour. Two speakers are already announced: R&D project manager Cécile Péreaira will present a text-mining use case with Neo4j in biology, and data scientist Antoine Bonnefoy will sum up the Parisian Neo4j conference, from technology and business viewpoints. After the talks, all attendees will be offered a casual dinner to pursue the discussion.

 

Practical information:

What? Marseille Community Event – Neo4j GraphTour

When? Monday the 12th of March, from 6:30 PM to 8:30 PM

Where? Le Wagon, 167 Rue Paradis,  Marseille

Registration: : https://www.eventbrite.fr/e/billets-neo4j-graphtour-marseille-community-event-42714338737?utm_campaign=new_event_email&utm_medium=email&utm_source=eb_email&utm_term=viewmyevent_button

Discovering Interesting Patterns in Large Graph Cubes

Due to the increasing importance and volume of highly interconnected data, such as in social or information networks, a plethora of graph mining techniques have been designed to enable the analysis of such data. In this work, we focus on the mining of associations between entity features in networks. We model each entity feature as a dimension to be analyzed. Consequently we build our approach on top of the existing graph cube framework which is an extension of the concept of the data cube to networks. Our task is particularly challenging because it requires the analysis of both the initial multidimensional network and all its subsequent aggregate forms. As soon as we deal with a big data situation it is impossible for an analyst to consider manually all the possible views of the network data. The aim of this work is to design an algorithm for the discovery of interesting patterns in large graph cubes. Thus, instead of examining all the possible aggregations manually, the proposed technique leads the analyst to the interesting associations or patterns in the multidimensional network. Furthermore, we study the application of existing algorithms from the frequent itemset mining literature on graph data and propose a mapping between the two settings.

Florian Demesmaeker, Amine Ghrab, Siegfried Nijssen, Sabri Skhiri: Discovering interesting patterns in large graph cubes. 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, USA, 2017, pp. 3322-3331.

Click here to access the paper.

Second Workshop on Real-Time and Stream Analytics in Big Data

EURA NOVA is thrilled to share the news with you: we are organizing our second workshop collocated with the 2017 IEEE International Conference on Big Data. The workshop will take place in December in Boston, MA, USA.

 

Stream processing and real-time analytics have caught the interest of the industry lately. Many use cases are waiting for relevant and efficient solutions to be developed. Such use cases include event-driven marketing, dynamic network management & optimization, real-time recommendation, context-aware applications and real-time fraud detection.

 

After the success of the first edition, this is an excellent opportunity to bring together the industry and academics  to discuss, to explore and to refine new opportunities and use cases in the area. The workshop will benefit  both researchers and practitioners interested in the latest research in real-time and stream processing. The workshop will showcase prototypes and products leveraging big data technologies as well as models, efficient algorithms for scalable complex event processors and context detection engines, or new architecture leveraging stream processing.
Want to submit a paper? Check out the workshop website to find all the information you  will need. Your paper will be reviewed by a prestigious panel of international experts from both the academic and the industrial worlds.

Next Workshop on Graph Business Intelligence

EURA NOVA is organizing their second workshop collocated with an international conference. This time, the workshop will be collocated with  the 21th European Conference on Advances in Databases and Information Systems. It will take place in September in Cyprus and will bring together industrial and academic stakeholders to discuss, explore and refine new opportunities and use cases in the area of Graph Business Intelligence.

 

Want to be part of the fun? Check out the workshop website to find all the information you need to know and submit your paper. Our researchers Sabri Skhiri, Salim Jouili and Amine Ghrab cannot wait to read your papers and meet you in Nicosia.

Big Data Architectures at Universitat Politècnica de Catalunya

Today and Wednesday (the 13rd and the 15th of March 2017), our R&D Director will be in Barcelona to give a course about Big Data Architectures.

The objective is to learn the basic concepts and details to take into account when designing a Big Data Architecture. The student will learn the impact of technical & functional constraints on the storage and processing choices. Going further the course will show, through industrial use cases, the raise of new architecture patterns. The course includes a practical part with hands-on session on distributed frameworks.

Contents :

  • Terminology & Concepts
  • Distributed architecture
  • Big Data Storage
  • Big Data Processing
  • Big Data Architecture Patterns (Hands-on session)
  • Distributed processing with Apache Flink / Spark
  • Data manipulation with Apache Pig

For more details, contact Oscar Romero ( oromero@essi.upc.edu )

Want to host Sabri Skhiri for a course in your university? Contact research@euranova.eu

ENX University in Tunis

On the 9th and 10th of May 2017, the R&D Director of EURA NOVA Sabri Skhiri will lecture on Big Data and Data Science at the Polytechnic School of Tunisia. The course will be hosted by the SERCOM laboratory.

After the launch of EURA NOVA Tunis last September, this course will be a new opportunity for us to bond a little more with Tunisians, especially students. Indeed, EURA NOVA offers programmes in collaboration with universities, such as boot camps, master thesis, research internships and PhDs, and engineering internships. We hope that this lecture will make Polytechnic students want to explore Data Science with us and join the pack!

 

Want to organise a lecture on Big Data and Data Science in your own university? Contact research@euranova.eu and ask for ENX University offer.

 

Here is the detailed programme [in French]

 

Mardi 9 mai 2017: Architecture BIG DATA (partie 1)

Matin (8h30-12h30)

  1. Terminologie et concepts généraux
  2. Architecture distribuée
  3. Stockage du Big Data : NoSQL, NewSQL, Systèmes de fichiers distribués

Pause déjeuner : 12h30-14h

Après-midi : 14h-17h

Travaux pratiques : Préparation de données : Script Pig

    1. Introduction à Pig
    2. Exercice de préparation de données

______________________________________________________

 

Mercredi 10 mai 2017 : Architecture BIG DATA (partie 2)

Matin (8h30-12h30)

  1. Traitement du Big Data : Batch et Streaming
  2. Patrons d’architecture Big Data
  3. Architectures adoptées dans des contextes industriels : Etude de cas

Pause déjeuner : 12h30-14h

Après-midi : 14h-17h

Travaux pratiques sur Apache Spark/Flink

    1. Introduction à Flink et commande Scala de base
    2. Traitement de données en batch et en stream

 

 

 

 

EURA NOVA R&D has a new rallying cry : Join The Pack!

Screenshot from 2016-07-26 17-35-03

 

After launching our first bootcamp, we are organising our first workshop colocated with IEEE conference. The workshop will take place in December in Washington D.C. and will bring together industrial and academic stakeholders to discuss, explore and refine new opportunities and use cases in the area of stream processing and real-time analytics in big data.

Indeed, stream processing and real-time analytics have caught the interest of the industry lately. Many use cases are waiting for relevant and efficient solutions to be developed. Such use cases include event-driven marketing, dynamic network management & optimization, real-time recommendation, context-aware applications and real-time fraud detection.

The workshop will showcase prototypes or products leveraging big data technologies as well as models and efficient algorithms for scalable complex event processors and context detection engines. Here is a short list of research topics to inspire you :

  • New stream processing architecture for big data.
  • Complex event processing for big data, pattern matching engines for big data.
  • Scalable real-time decision algorithms.
  • Scalable stream processing architecture, algorithms or models.
  • Stream SQL and other continuous query languages on big data frameworks.
  • Algorithms for high-speed data stream mining.
  • On-line/incremental learning on data streams.

Your paper will be reviewed by a panel of academic as well as industrial experts.  

Find more information about program co-chairs and members on the workshop website and submit your paper to join the Euranovian pack!

Don’t miss the chance to be part of an IEEE conference and to see Washington under the snow.

 

 

BOOT CAMP 2017

EURA NOVA is launching an intense 3-month I.T. boot camp starting September 2017.

Installing TensorFlow with distributed GPU support.

Today, I wrote my first “Hello World” script using the freshly open-sourced version of TensorFlow with distributed GPU support. At the time of this writing, the binary releases of TensorFlow don’t come with the distributed GPU support therefore I had to build TensorFlow from sources. All the documentation to do this already exists but is a bit scattered on multiple websites. Here is a condensed version of the install process (on a Linux Ubuntu 14.04 platform).

Continue reading