To prevent spam users, you can only post on this forum after registration, which is by invitation. If you want to post on the forum, please send me a mail (h DOT m DOT w DOT verbeek AT tue DOT nl) and I'll send you an invitation in return for an account.

Which cluster algorithm is able to cluster on multiset propertes of cases?

kvm
kvm
edited May 2014 in ProM 6

Dear Prom users,
I am looking for a cluster algorithm that works on multisets of activities. The sequence of activites is not important in my case, and the algorithms I have read so far cluster partially on the sequence of activities. Which algorithm could be suited to group cases based on their multiset properties? (http://en.wikipedia.org/wiki/Multiset) Compare which activities are performed and how many times each activity is performed. Does anyone has an idea?

Best Answer

  • JBuijs
    Accepted Answer

    You might want to look into the work of J.C. Bose, he proposed (and I assume implemented) many of such trace clustering algorithms.

    Joos Buijs

    Senior Data Scientist and process mining expert at APG (Dutch pension fund executor).
    Previously Assistant Professor in Process Mining at Eindhoven University of Technology
Sign In or Register to comment.