Earn money by answering our surveys

Register now!
IntoTheMinds consulting blog
Advice in Data & IT
The UK Data Ethics Framework Explained
Apr02

The UK Data Ethics Framework Explained

First published in June 2018, the Government’s Data Ethics Framework is another way of protecting our sensitive information in a data-driven world. The framework was then updated in September 2020 to be more extensive and to create more awareness of these guidelines. Yet despite this, many are still unaware of what this framework actually is and what purpose it serves. That’s why we’ve put together this guide. Below, we’re going to...

Read More
Data wrangling: definition, steps, tools [guide 2021]
Mar29

Data wrangling: definition, steps, tools [guide 2021]

Data wrangling might sound like quite the intimidating tech term on the surface yet anyone can develop an understanding of this process with a little bit of time and effort. Data wrangling, also known as data munging, is part of the larger process of  “data preparation”. Data wrangling is the process performed to collect, select, re-structure, enrich and ultimately transform information with the aim of answering a specific question....

Read More
ETL benchmark: how long does it take to process 1 billion lines?
Mar26

ETL benchmark: how long does it take to process 1 billion lines?

In a previous article, I highlighted the importance of processing speed when choosing a data preparation solution (ETL). I made a first benchmark between Alteryx, Tableau Prep, and Anatella on a file of 108 million lines. This time I repeated the operation on 1.039 billion lines and added Talend to the benchmark. The results are unexpected since the processing speeds vary by a factor of 1 to 20. TEASER : stay tuned. In my next article...

Read More
Data preparation: how to choose the right solution for your needs?
Mar22

Data preparation: how to choose the right solution for your needs?

I regularly use 4 data preparation solutions: Alteryx, Tableau Prep, Talend, and Anatella. The G2 benchmark lists no less than 40 of them. This article explains why the G2 benchmark is not necessarily relevant and what criteria to use to compare the ETL (Extract – Transform – Load) solutions best suited to your needs. Summary What is an ETL? The G2 benchmark criteria The most important criteria for choosing an ETL, in my...

Read More
Benchmark: which ETL to choose to handle large files?
Mar15

Benchmark: which ETL to choose to handle large files?

When it comes to data preparation, it is clear that a large part of the processes still takes place offline, with archives extracted from relational databases. And when these archives need to be manipulated by data engineers, it is essential to have an ETL that can handle the load. In today’s article, I propose to tackle the fascinating subject of large files by comparing 2 ETL’s that I use daily: Alteryx and Anatella....

Read More
ETL: use of ternary operators for data extraction in Excel
Feb12

ETL: use of ternary operators for data extraction in Excel

How to use an ETL solution to extract information from an Excel file and reconcile it? After explaining a method to automate the extraction from many Excel files, in today’s post, I will go into more depth and explain how I used ternary operators to extract the desired information. Presentation of the problem The problem I have faced is relatively standard. You have a poorly structured file that has been filled in by a user or...

Read More
How to automate the extraction of data from Excel files
Feb01

How to automate the extraction of data from Excel files

This is the problem that I have been facing recently. A client asked me to process data from hundreds of Excel files. Frankly, I didn’t want to spend hours extracting this data manually. So, I automated the process with Anatella and visualized the data using Tableau afterward. Here’s a little tutorial on data preparation and an excellent way to process your data more efficiently with an ETL worthy of the name. If you only...

Read More
Data preparation: definition, examples, advice [guide 2021]
Jan18

Data preparation: definition, examples, advice [guide 2021]

To talk about data preparation, what better way to start than from observation. In the world of data, there is a rule that everyone knows: 80% of a data scientist’s time is spent preparing his data, and only 20% working on it, especially its visualization. This article will focus on data preparation: the most frequently encountered problems, tools, and trends. Let’s go for new data adventures! Summary Introduction Data...

Read More
Data visualization: definition, examples, tools, advice [guide 2021]
Dec09

Data visualization: definition, examples, tools, advice [guide 2021]

Data Visualization, or DataViz, is the discipline that focuses on the graphical representation of raw data. The purpose of data visualization is to create value. The last few years have seen the emergence of new tools and practices around data visualization and new professions’ emergence. This article will start by giving you the definition of data visualization and illustrate with a concrete example of the 5 levels of data...

Read More
[Podcast] Teaching children to use digital technology
Nov23

[Podcast] Teaching children to use digital technology

Nathalie Kuborn (We Are Coders), Gaëlle Girardeau (Cood) and Christophe Coquis (Geek Junior) talk to us about learning about computers, digital technology and the introduction to digital culture for children: the market, courses and advice. The market for teaching digital technology to children The emergence of a need The market for computer-based learning for children is emerging. For Gaëlle, this can be explained by the fact that we...

Read More
Data science explained by a child
Nov20

Data science explained by a child

The term “data science” remains opaque to the average person. Yet these two words are often used in the press as one of the most trendy disciplines of the moment. To demystify the terms used in the field of data, we have launched a series of videos hosted by our star presenter: Pierre-Raffaele (10 years old at the time of shooting) 🙂 After the success of his series on “The Business World”, Pierre-Raffaele is...

Read More
[Podcast] The essence of Data Art and Data Visualisation
Oct09

[Podcast] The essence of Data Art and Data Visualisation

Nicholas Rougeux defines himself as a “data artist”, and his talent for visualising data is obvious. So obvious that he markets his work in the form of posters and I even got one for my birthday 🙂 Run to his blog (www.c82.net) if you lack inspiration or if you want to see the best in DataViz. In this podcast, I talk with Nicholas about his creative process, his sources of inspiration, and of course, the technical aspects...

Read More
Tutorial: How to create a Joyplot using Tableau
Oct02

Tutorial: How to create a Joyplot using Tableau

After having explained to you how to create an isometric hexmap, I now tackle a more difficult subject: creating a joyplot using Tableau. I used a joyplot in the visualisation I submitted to the IronViz. As this type of graph is relatively rare, and I’ve had some trouble getting it right, I thought a tutorial would be welcome. Summary Definition of joyplot Origin of the name “Joyplot” Tutorial on how to create...

Read More
Covid-19: impact on the IT consulting market and outlook
Sep25

Covid-19: impact on the IT consulting market and outlook

The Covid-19 crisis has had a massive impact on the IT market. Some parts of this sector have benefited (think of everything that revolves around remote working), but others have also suffered enormously. This is the case of IT consultancy. According to a Whoz study published in June 2020, 13% of projects had been abandoned, and 21% had been put “on hold”. Jean-Philippe Couturier, the CEO of Whoz, discusses in this podcast...

Read More
Rethinking the customer experience in museums with (Big) data
Sep09

Rethinking the customer experience in museums with (Big) data

How can Big Data help museums reinvent themselves? How can data collection techniques be used to understand visitors better? How can data be used to build visitor loyalty? In this article, we propose a series of reflections on current and future museographic devices and on “data” initiatives that can be put in place to improve the customer experience and increase visitor loyalty. Summary Introduction Museums in the Age of...

Read More
Isometric Hexmap using Tableau: a step-by-step guide
Aug10

Isometric Hexmap using Tableau: a step-by-step guide

As part of my participation in the Iron Viz 2020, I created a new type of visualisation using Tableau, which has received a lot of positive feedback. I called it ‘isometric hexmap’. In this article, I explain how to reproduce this type of representation and give you access to the source files. You can access the interactive version on Tableau Public. If you like this visualisation, feel free to leave a comment and to use...

Read More
Tableau Iron Viz 2020: thoughts on my first participation
Aug04

Tableau Iron Viz 2020: thoughts on my first participation

The Iron Viz is the “world championship” that crowns the 3 leading Tableau developers, a software that allows you to visualise the data. This year I decided to participate also (access my contribution here). In this article, I look back on my experience, what I learned from the preparation of the competition and the difficulties I encountered. 6 tips for a good #Dataviz Spend time choosing a rich, quality primary data...

Read More
Migration to Europe: interactive visualisation using Tableau
Jul13

Migration to Europe: interactive visualisation using Tableau

How can migration to EU countries be represented attractively? This is the question I wanted to answer using Tableau Software. If the question was simple, the answer was not so simple, and it took me a few days of deliberation to find the right solution and the right design. Below I present it to you with the modus operandi to reproduce it. Only for our subscribers: exclusive analyses and marketing advice Email address  *...

Read More
What do you have to talk about on LinkedIn to get likes and comments?
Jul06

What do you have to talk about on LinkedIn to get likes and comments?

After analysing the best hours to post on LinkedIn, I tackled a much more difficult problem: determining which topics generated the most views and engagement on LinkedIn. In today’s post, I will show you the results I was able to achieve using 2 specific tools: Anatella for data extraction and preparation on the one hand, and Tableau for exploring and visualising the effects on the other. If you are interested in this type of...

Read More
What is the best time to post on LinkedIn?
Jun29

What is the best time to post on LinkedIn?

In their attempt to understand how the LinkedIn algorithm works, users of this social network often debate the ideal time to put their content online. I decided to do some analysis to get to the bottom of the truth and myths in this story. If you are interested in this kind of analysis and want to do the same with your data, subscribe to our newsletter (below). When the procedure and tools are ready to be released to carry out these...

Read More
Fuzzy Matching with Alteryx: tests, results and comparison
Jun26

Fuzzy Matching with Alteryx: tests, results and comparison

Fuzzy matching is a handy technique in many everyday situations. Alteryx offers a Fuzzy Matching feature that I have compared to Anatella’s function. I had already talked about Fuzzy Matching in this post. The features of Fuzzy Matching in Tableau Prep builder not being convincing, I wanted to see what Alteryx had to offer. In the end, Anatella gives better results, and the programming of the ETL process is much more efficient....

Read More
Fuzzy matching: comparison of 4 methods for making a join
Jun19

Fuzzy matching: comparison of 4 methods for making a join

In a previous article, I shared with you a solution to make a fuzzy matching between 2 different tables. I had then compared 2 ETL (Extract Transform Load) solutions. Tableau Prep Builder did not achieve the desired result. So, I turned to Anatella. In today’s article, I explore the different Fuzzy Matching algorithms available in this tool and their effects. As you will see, an algorithm emerges as the winner of the...

Read More
Linkedin algorithm: 1 reaction will get you 83 views
Jun15

Linkedin algorithm: 1 reaction will get you 83 views

How does Linkedin algorithm influence? What is the correlation between the number of likes or comments and the number of views in LinkedIn? Everybody’s trying to understand LinkedIn’s algorithm. Some are even trying to hack it. While theories abound, no one knows how it works. I’ve rolled up my sleeves, and today I’m going to explain it to you, with a statistical model to back it up. BONUS: receive your...

Read More
Fuzzy matching between tables: 2 ETL compared (Tableau Prep Builder vs Anatella)
Jun12

Fuzzy matching between tables: 2 ETL compared (Tableau Prep Builder vs Anatella)

If you are manipulating data for analysis or visualisation purposes, you may have encountered this problem before. You need to create a join between 2 databases, but the entries in the reference field are not the same. Differences in spelling, different terminologies, … the reasons are many and varied. During the research that I’m doing to visualise migration flows, I was confronted with this problem. I had to solve it by...

Read More
New LinkedIn algorithm: here’s how to stay visible
May27

New LinkedIn algorithm: here’s how to stay visible

Linkedin’s engineers published the recipe for their new algorithm on May 12, 2020. The least we can say is that it completely changes the practice of making a post go viral. If you want your LinkedIn posts to be seen (and incidentally become popular), read the following carefully. Only for our subscribers: exclusive analyses and marketing advice Email address  * Subscribe By signing up, you agree to our Terms of use and...

Read More
Connected insurance: reliability of connected wristbands and hacking
Apr15

Connected insurance: reliability of connected wristbands and hacking

In today’s article, we will talk about the reliability of connected Fitbit wristbands and how they are being hacked. The phenomenon is still not very widespread in Europe, but in the United States, it has become a real problem. It is the owners of these connected bracelets who try to hack them. Why are they trying to hack them? We explain everything in this article. We take this opportunity to give you an analysis of the...

Read More
How to hack the recommendation algorithms?
Feb12

How to hack the recommendation algorithms?

Hacking a recommendation algorithm is an operation that can be very profitable. Indeed, some recommendation algorithms are ubiquitous in our digital life and strongly influence our behaviour. We have become dependent on algorithms without realising it. For example, 35% of purchases on Amazon are the direct consequence of an algorithmic recommendation. An algorithm suggests 80 % of movies watched on Netflix and 75% of videos on...

Read More
Simon Weckert, a German artist, has succeeded in hacking Google Maps
Feb07

Simon Weckert, a German artist, has succeeded in hacking Google Maps

Hack an algorithm? Not necessarily as complicated as it sounds, and the economic consequences can be significant. Indeed, everything today is governed by algorithms, especially recommendation algorithms. Hijacking a recommendation algorithm can, therefore, have considerable financial repercussions. In today’s article, we would like to present the project of Simon Weckert, a German artist who “hacked” the Google Maps...

Read More
2025: the end of personalisation algorithms?
Jan13

2025: the end of personalisation algorithms?

At a time when Amazon already achieves 35% of its turnover thanks to the algorithmic recommendation and launches two new personalisation tools (Discover and Showroom), Gartner announces the end of personalisation algorithms by 2025.  This ambiguous announcement seems to deserve a more in-depth analysis. Contents Infographics on personalisation algorithms The purpose of algorithmic personalisation Why abandon personalisation? Problem...

Read More
[Podcast] Promises and challenges of artificial intelligence with Ségolène Martin
Dec09

[Podcast] Promises and challenges of artificial intelligence with Ségolène Martin

In this week’s Podcast, we welcome Ségolène Martin, the CEO of Kantify, a company specialising in artificial intelligence, based in Brussels. Artificial intelligence has undoubtedly been one of the most popular subjects in recent months, but in this Podcast, we will try to go beyond the clichés. We will, therefore, focus on the real added value of artificial intelligence in companies, but also on the difficulties that the launch...

Read More
How to achieve better marketing and market research with Big Data
Nov27

How to achieve better marketing and market research with Big Data

Technological progress today allows us to enjoy excellent living conditions. Control of space, control of reproduction, control of information are challenges that we have already accomplished. However, time control remains out of reach. Many human obsessions revolve around the power of this resource, offered to us at birth in a desperately finite quantity: time. Staying young, living old, perpetuating your memory, predicting the...

Read More
Personalising customer relations: how to gain trust and data?
Nov25

Personalising customer relations: how to gain trust and data?

The recommendation algorithms promise marketers to achieve their Holy Grail: that of personalised relationships with all their customers at a reasonable price. Since the quality of customer relations is a decisive factor for customer satisfaction and loyalty, algorithmic recommendations, therefore, represent the promise of quality interactions with customers and meaningful marketing actions. However, large-scale customisation can only...

Read More
How to get Internet users to share more personal data?
Nov13

How to get Internet users to share more personal data?

What personal data are Internet users willing to share in exchange for better personalisation of the marketing offer? How to adapt your marketing strategy to get more data? This question is the subject of much debate at a time of the GDPR and the dawn of the entry into force of e-privacy. What data can a company still collect without fear that its customers will lose confidence? It is this tricky but crucial question that we propose...

Read More
Which regions of France have the most enterprising approach?
Oct18

Which regions of France have the most enterprising approach?

In our last post, we presented you with a map representing the French counties where the most (and least) companies are created. This analysis, as impressive as it may be, needed to be further developed to take into account the disparities that exist in terms of population density. A region with a tiny population will necessarily create fewer companies than a large city (let alone a capital). It was, therefore, necessary to take into...

Read More
Firms’ creations in France: an interactive visualization
Oct02

Firms’ creations in France: an interactive visualization

The National Institute of Statistics (see our article on the search for statistical data for your market research) make quality data available to all, but it is still necessary to be able to use them efficiently. They are often in their original form and are therefore not easily accessible to newbies. For example, INSEE (the French national statistics institute) produces excellent data on the creation of businesses in France, but it...

Read More
Media research : 4 data journalism projects worth knowing
Mar18

Media research : 4 data journalism projects worth knowing

As part of the Big Data Initiative (which I’m chairing) of the European Broadcasting Union, a conference was organized on 26 and 27 February 2019 in Munich to discuss the latest members’ projects in the field of data journalism. Although more than a dozen presentations related to this topic twere given, I chose to report and compare in this article only 4 of them that are extremely concrete and launched each in a different...

Read More