This is Flux Capacitor, the company weblog of Fluxicon.
You can find more articles here.

You should follow us on Twitter here.

Data Requirements FAQ: How to Extract Data for Process Mining? 5

Finding the right data for process mining.

In our last post, I was talking about the process-oriented mental model that underlies process mining to explain what kind of data are needed. In the coming posts, I will be covering a number of more practical questions that come up regularly.

Here is the first one.

FAQ #1: How easy is it to extract data?

The honest answer is “It depends”. It depends on the domain and the source systems you are extracting the data from.

What you need to look for

In most situations it is advisable to work with the IT staff of your organization. They will extract the data for you. It is your task to tell them what kind of data you need. For that, you need to be able to identify the three elements described in the previous post:

Most of the time, it is easy to find the activities and timestamp information. As for the case ID, that depends. For example, in any customer service system, or in IT services, it is easy to find some kind of ticket number that can be used as a case ID. Also in hospital information systems, patient ID numbers are readily available to differentiate the diagnosis and treatment processes for different patients.

In other situations it can be more tricky: For example, for complicated end-to-end processes in ERP systems such as the purchase-to-pay process one may need to connect purchase order numbers with the corresponding invoice numbers to get the complete picture.

Start simple

As always, you need to manage the trade-off between effort (to extract and analyze the data) and benefit (to understand and improve the underlying business process).

Overall, my experience is that if the business is determined to use process mining, getting the data is not an issue at all.1 Typical drivers are that they want to understand and improve their processes, either because they have the perception that something is broken, or because they need greater transparency of what is going on to be able to react faster and become more pro-active.

What is your experience? How easy was it to get the data you needed for your process mining project?

  1. Get in touch with us if you plan to use process mining in your organization and need advice for the data extraction phase. 

Comments (5)

[…] Process Mining – Anne Rozinat Overall, my experience is that if the business is determined to use process […]

I have not had any problem to get the data for my project:-)

That’s good to hear. Thanks for the feedback, Peter!

i am trying to do a project on process mining,how do i extract data for the to convert it to .xes format

Hi Benjamin,

Great to hear that you are looking into process mining.

Note that you don’t necessarily need to create .xes formatted files to do process mining. For example, in Disco (see you can simply import CSV files.

Disco can also export .xes files if you do need to to import your event log into ProM. You can find more information in the Disco user guide here:

I hope this helps,

Leave a reply