Sequential patterns mining algorithm based on improved PrefixSpan

PrefixSpan,the classic sequential patterns mining algorithm,has the problem of producing huge amount of project databases.To solve this problem,a sequential patterns mining algorithm named SPMIP was proposed based on an improved PrefixSpan.This algorithm reduced the scale of projected databases and the time of scanning projected databases through adding pruning step and reducing scanning of certain specific sequential patterns production.In this way,algorithm efficiency could be raised up,and the needed sequential patterns were obtained.The experimental results show that SPMIP is more efficient than PrefixSpan while obtained sequential patterns have not been affected.