Giter Site home page Giter Site logo

Questions on trace data about cacheus HOT 12 CLOSED

sylab avatar sylab commented on August 18, 2024
Questions on trace data

from cacheus.

Comments (12)

zy1024cs avatar zy1024cs commented on August 18, 2024 2

Hi, I'm running the experiment as well but find it hard to reproduce the access pattern in this figure. Screenshot from 2022-05-24 17-59-39

As far as I can see, the downloaded trace files are in this format: [ts in ns] [pid] [process] [lba] [size in 512 Bytes blocks] [Write or Read] [major device number] [minor device number] [MD5 per 4096 Bytes] Some sample data from webmail day 16 (webmail.cs.fiu.edu-110108-113008.16.blkparse) would like 1299602115063356 318 kjournald 220040 8 W 2 0 d186315aa2d4c75b753fe9ca98187cdd

To draw the same figure, we have the timestamp (first column) and the logic block address (fourth column). However, if we draw a scatter plot over the time and block address, the resulting figure would look like this: access patern Very likely I misunderstood the dataset and drew it in the wrong way. It'd be very kind of you if you could explain a bit how to generate that figure. Thank you very much!

You should narrow the range of the coordinate axis, otherwise you often see a line. Because if you want to have that access mode, you can usually observe it in a small range.

from cacheus.

lia54 avatar lia54 commented on August 18, 2024 2

Hi @Wkkkkk thanks for the interest.
For the purpose of building the access pattern, we mapped each lba value of the trace to a unique number to reduce the range on the y-axis of the plot. That's probably why you're seeing a different graph, since you're directly plotting the lba from the trace. The x-axis is the timestamp converted from nanoseconds to hours.

from cacheus.

Wkkkkk avatar Wkkkkk commented on August 18, 2024 1

Hi,
I'm running the experiment as well but find it hard to reproduce the access pattern in this figure.
Screenshot from 2022-05-24 17-59-39

As far as I can see, the downloaded trace files are in this format:
[ts in ns] [pid] [process] [lba] [size in 512 Bytes blocks] [Write or Read] [major device number] [minor device number] [MD5 per 4096 Bytes]
Some sample data from webmail day 16 (webmail.cs.fiu.edu-110108-113008.16.blkparse) would like
1299602115063356 318 kjournald 220040 8 W 2 0 d186315aa2d4c75b753fe9ca98187cdd

To draw the same figure, we have the timestamp (first column) and the logic block address (fourth column). However, if we draw a scatter plot over the time and block address, the resulting figure would look like this:
access patern
Very likely I misunderstood the dataset and drew it in the wrong way. It'd be very kind of you if you could explain a bit how to generate that figure. Thank you very much!

from cacheus.

lia54 avatar lia54 commented on August 18, 2024

Hi @brucechin,
Thank you for your interest in our paper!

Please find the datasets and the corresponding links/locations (when possible):

  1. FIU SRC_Map (All traces are a one-day duration for each file).
  2. MSR Cambridge (These traces are a one-week duration for each file. For the paper, we extracted the first day only based on the timestamp).
  3. CloudVPS (These traces are a one-day duration for each file).
  4. CloudCache is a collection of the webserver and moodle traces that were used in the CloudCache paper from FAST'16. (All one-day duration).
  5. CloudPhysics are non-public traces used in the SHARDS paper from FAST'15 that were shared directly from the authors.

from cacheus.

Wkkkkk avatar Wkkkkk commented on August 18, 2024

Hi,
Thank you both for the quick reply! Sure I'd like to try the mapping to narrow down the range in the y-axis. Would it be possible for you to share a bit about how you did the mapping?
Your paper is quite interesting and your insights about the access pattern mean a lot!

from cacheus.

Wkkkkk avatar Wkkkkk commented on August 18, 2024

The latest plot if we just draw the ones blow 140K.
Screenshot from 2022-05-25 09-25-58

from cacheus.

zy1024cs avatar zy1024cs commented on August 18, 2024

The file types of CloudVPS seem to be the ".blktrace.1". Please ask me how I should convert them, after all the contents seem impossible to handle
QQ截图20220526145142.
includi
QQ截图20220526151541
ngCloudCache

from cacheus.

jzx-bitdb avatar jzx-bitdb commented on August 18, 2024

Hi, Thank you both for the quick reply! Sure I'd like to try the mapping to narrow down the range in the y-axis. Would it be possible for you to share a bit about how you did the mapping? Your paper is quite interesting and your insights about the access pattern mean a lot!

I have the same question about how to range in the y-aixs. Have you solved this problem?

from cacheus.

jzx-bitdb avatar jzx-bitdb commented on August 18, 2024

Hi @Wkkkkk thanks for the interest. For the purpose of building the access pattern, we mapped each lba value of the trace to a unique number to reduce the range on the y-axis of the plot. That's probably why you're seeing a different graph, since you're directly plotting the lba from the trace. The x-axis is the timestamp converted from nanoseconds to hours.

Could you tell me how to map each lba value of the trace. I have two ideas about it. The first one is scale each lba value by the same factor. The second is iterator the request sequence to map them to an increment value. I have the similar picture as yours by the second method. Am I right? And what is the difference between the two methods?

test

from cacheus.

Wkkkkk avatar Wkkkkk commented on August 18, 2024

Hi @xuangestallone,
Your results seem to be a great example. Would you mind sharing the second method in more detail or the code so I could repeat and try it myself as well? I want to keep experimenting and discussing it with you if possible.

My email is [email protected].

from cacheus.

zwh272581638 avatar zwh272581638 commented on August 18, 2024

The file types of CloudVPS seem to be the ".blktrace.1". Please ask me how I should convert them, after all the contents seem impossible to handle QQ截图20220526145142. includi QQ截图20220526151541 ngCloudCache

Perhaps you should read the user manual on the fio and blktrace official website

from cacheus.

sylab avatar sylab commented on August 18, 2024

Closing due to inactivity.

from cacheus.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.