PoS - Proceedings of Science
Volume 299 - The 7th International Conference on Computer Engineering and Networks (CENet2017) - Session III -Information Theory
Frequent sequence mining from massive access log for user’s behaviour investigation
W. Chen*, Y. Tong, J. Zhang and T. Qin
Full text: pdf
Pre-published on: July 17, 2017
Published on: September 06, 2017
Abstract
With the fast development of Web 2.0, users can obtain everything that they want from the Web. and their access behaviours are recorded by the access log. Based on mining the frequent access
sequence, we can deeply understand their access interests. In turn, it can improve the efficiency of network management. In this paper, we firstly present the methods for log pre-processing and
extract the features. Secondly, we employ the PrefixSpan algorithm to achieve the goal of frequent sequences mining. In order to process the massive log data in network today, we also combined the proposed methods with Spark. Finally, experimental results based on the log data collected from the campus network of Xi’an Jiaotong University verify the efficiency of the developed methods, which are useful for the understanding and management of the user’s behaviour.
DOI: https://doi.org/10.22323/1.299.0061
How to cite

Metadata are provided both in "article" format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in "proceeding" format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.