PDC 2021 Data Set Disclosed

The data set of the Process Discovery Contest of 2021 (PDC 2021) has now been disclosed. It contains the following four folders (in ZIP archives):

  • Ground Truth Logs: The 96 test logs (.xes format) as classified by the corresponding models.
  • Models: The 96 original workflow nets (.pnml format) used to generate the logs.
  • Test Logs: The 96 logs (.xes format) to classify using the models as discovered by the submitted algorithm from the training logs.
  • Training Logs: The 480 logs (.xes format) to discover the models from using the submitted algorithm.

Leave a Reply