# Differences

This shows you the differences between two versions of the page.

Both sides previous revision Previous revision | |||

assignments:advanced_process_mining_techniques_in_practice_several_master_projects_with_processgold [2017/11/27 15:08] dfahland |
assignments:advanced_process_mining_techniques_in_practice_several_master_projects_with_processgold [2018/01/15 20:41] (current) dfahland [Possible Graduation Project Topics] |
||
---|---|---|---|

Line 8: | Line 8: | ||

===== Possible Graduation Project Topics ===== | ===== Possible Graduation Project Topics ===== | ||

- | * **Visually compare BPMN models created by modelers with mined models discovered from event data.** | + | * **Enhancing the Inductive Miner.** The inductive miner is a process mining algorithm that can inductively discover structures, such as choice or parallelism, from event logs. We would like to investigate how we can integrate the inductive miner into ProcessGold, and where, if possible, we can improve on the algorithm to suit our practical needs. |

- | *** Investigate how specific process improvement methodologies - such as Six Sigma - can be supported by Process Mining**, and design an application that supports this methodology. | + | * **Conformance checking using BPMN.** Business Process Model and Notation (BPMN) can be used to model desired or expected process behavior. We would like to investigate how we can import BPMN models into ProcessGold and how to integrate conformance checking into our platform using BPMN models. |

- | * **Investigate parallelism in mined process models.** Process models with parallelism are harder to interpret for users with a non-technical background. Investigate how a-priori knowledge about parallelism in a process can be leveraged to aid users in analyzing these processes. | + | * **Prediction/simulation of throughput times.** We would like to investigate how we can predict the throughput times in a process based on its mined model using predictors or simulation. |

- | * **Develop a proof of concept for process mining for processes distributed over multiple geographical locations** by making use of geographic maps. | + | * **Process mining with user access rights.** In some organizations, not all analysts may be allowed to see all details of their organization’s process or in some cases even some parts of the process. We would like to investigate how we can take these access rights into account while still providing the user with a process that can lead to meaningful insights. |

- | * **Conduct an extensive study on the field of process mining research and evaluate which current research trends and results would be practically applicable** in ProcessGold's process improvement platform. | + | * **Multi-instanced processes.** Many processes have hierarchical cases that split up into multiple sub-cases that merge again later. For example, the process of manufacturing a car. Typically, these sub-cases are independent of each other, which leads to parallelism in traditional process mining. We would like to investigate how to generate insights in these kinds of processes. We will look at a pathology use case where the cases consist of a hierarchy of sub-processes. |

- | * **Automate or improve user-support for event log extraction** from a wide variety of source systems. | + | * **Social analysis.** Social interaction is an important factor within processes where people are working together. A common approach to get insights is a social network. We would like to investigate other approaches to analyze the social interactions that happen within a process. |

- | * **Investigate process mining for event data recorded at different conceptual levels.** Design an application showing how interaction through multiple levels of the process can make easier analysis possible. | + | * **Process flows on maps.** Some processes can be expressed as a flow, where goods, such as packages, cargo, or money, or physical objects, such as cars or vessels, flow between predefined geographic locations. We would like to investigate how we can interactively visualize these flows on a map to enable our users to explore, understand, and find anomalies in them. |

+ | * **Interactive grouping of processes.** Process data often consists of multiple sub-processes or groups of cases that exhibit similar behavior. Displaying and analyzing all these cases as a single process model may be difficult and confusing. Therefore, we would like to investigate how we can let the user interactively separate the cases of these processes into meaningful groups that can be explored separately. | ||

+ | * **Distributed calculation of expressions.** The ProcessGold platform owes much of its flexibility to an internal expression language. This expression language is used to compute a wide variety of expressions on a very large number of records. Currently, these expressions are computed on a single thread and may form a bottleneck. We would like to investigate how we can distribute the calculation of these expressions over multiple threads to increase overall performance. | ||

+ | * **Performance.** As the scale of the data of our customers grows, the shear amount of calculations that need to be performed on the data grows as well. To increase overall performance, we would like to investigate strategies such as flattening the data model, resorting data, and decreasing indirections. | ||

+ | * **Evaluation of moving calculations to a SQL or MapReduce backend.** As the scale of the data of our customers grows, the shear amount of calculations that need to be performed on the data grows as well. As a strategy to increase performance, we would like to investigate the possibilities and impact of offloading these calculations to a SQL or MapReduce backend. | ||

In all projects, the intern should be able work out the problem definition (in collaboration with ProcessGold and the supervisor), come up with a conceptual solution, and where applicable realize the solution in a proof-of-concept (in collaboration with ProcessGold). | In all projects, the intern should be able work out the problem definition (in collaboration with ProcessGold and the supervisor), come up with a conceptual solution, and where applicable realize the solution in a proof-of-concept (in collaboration with ProcessGold). |