MAWI Working Group Traffic Archive
Packet traces from WIDE backbone
This is a traffic data repository maintained by
the MAWI Working Group of
the WIDE Project.
Currently, traffic traces are collected at the following sampling points:
- samplepoint-F
- daily trace at another trans-Pacific line (150Mbps link) in
operation since 2006/07/01:
2006,
2007,
2008,
2009,
2010,
2011,
2012,
2013
48-hour-long traces on 2007/01/09-11,
72-hour-long traces on 2008/03/18-20,
96-hour-long traces on 2009/03/30-04/02,
83-hour-long traces on 2010/04/13-16,
and 63-hour-long traces on 2012/03/30-04/01,
as part of
a Day in the Life of the Internet project.
The link was upgraded from 100Mbps to 150Mbps on June 1 2007.
Older traces:
- samplepoint-A (terminated in Nov 2000)
- daily trace of a trans-Pacific T1 line (one of them):
1999,
2000
Y2K roll-over
- samplepoint-B
- daily trace of another trans-Pacific line (18Mbps CAR on 100Mbps link)
This link was terminated on 2006/07/01, and replaced by Samplepoint-F:
2000,
2001,
2002,
2003,
2004,
2005,
2006
107-hour-long traces
from another US-Japan link taken from 1999/05/10 10pm to 1999/05/15 7am.
The link was upgraded from 4Mbps to 10Mbps on 5/13 around 15:30.
24-hour-long traces on 2003/02/27,
2005/01/07,
2005/09/22,
and
2006/03/03.
plot graphs by aguri:
port numbers:
2001,
2002,
2003,
2004,
2005,
2006
(wide-members only: addresses:
2001,
2002,
2003,
2004,
2005,
2006)
- samplepoint-C
- daily trace at an IPv6 line connected to 6Bone:
2000,
2001,
2002,
2003,
2004,
2005,
2006,
2007,
2008,
2009
- samplepoint-D
- daily trace at an IPv6 line connected to WIDE-6Bone:
2000,
2001,
2002,
2003,
2004,
2005,
2006,
2007,
2008,
2009
- samplepoint-E
- yet another US-Japan line (OC-3) which came into operation in Dec 2001:
plot graphs by aguri:
port numbers:
2002,
2003,
2004,
2005,
2006,
2007,
2008,
2009
(wide-members only: addresses:
2002,
2003,
2004,
2005,
2006,
2007,
2008,
2009)
Traffic traces are made by
tcpdump, and then, IP addresses in the traces are scrambled by a modified
version of
tcpdpriv.
The
soure code of a set of tools used to create our archive is also available.
MAWILab
is a database that assists researchers to evaluate their traffic
anomaly detection methods. It consists of a set of labels locating
traffic anomalies in the MAWI archive (samplepoints B and F). The
labels are obtained using an advanced graph-based methodology that
compares and combines different and independent anomaly detectors. The
data set is daily updated to include new traffic from upcoming
applications and anomalies.
You may use WIDE traffic data for only research purposes.
Actions that trespass upon users' privacy are prohibited.
Here is guidelines for protecting user privacy
in WIDE traffic archive. (japanese version).
Related Papers
- R. Fontugne, P. Borgnat, P. Abry, K. Fukuda.
"MAWILab: Combining diverse anomaly detectors for
automated anomaly labeling and performance benchmarking".
ACM CoNEXT 2010. Philadelphia, PA. December 2010.
- Pierre Borgnat, Guillaume Dewaele, Kensuke Fukuda, Patrice Abry, Kenjiro Cho.
"Seven Years and One Day: Sketching the Evolution of Internet Traffic."
INFOCOM2009. Rio de Janeiro, Brazil. April 2009.
- Guilaume Dewaele, Kensuke Fukuda, Pierre Borgnat, Patrice Abry,
Kenjiro Cho.
"Extracting Hidden Anomalies using Sketch and Non Gaussian Multiresolution Statistical Detection Procedures"
SIGCOMM2007 LSAD Workshop, Kyoto Japan. August 2007..
- Kenjiro Cho, Ryo Kaizaki and Akira Kato.
"Aguri: An Aggregation-based Traffic Profiler"
In Proceedings of QofIS2001 (published by Springer-Verlag in
the LCNS
series). September 2001.
- Kenjiro Cho, Koushirou Mitsuya and Akira Kato.
"Traffic Data Repository at the WIDE Project"
USENIX 2000 FREENIX Track,
San Diego, CA, June 2000.
(HTML version)
- Akira KATO, Jun MURAI, Satoshi KATSUNO and Tohru ASAMI.
"An Internet Traffic Data Repository: The Architecture and the Design Policy"
INET99, San Jose, CA, June 1999.
Links