What are some of the commonly used techniques to analyze tick data?
I am looking at tick data to see how the quotes/ mid-price evolves due to certain events in the market. Since tick data is asynchronous one can't really apply traditional time series models to explain these price movements. Some people have proposed that I create price bars based on either clock-time or trade-time but I think that tends to miss out on information happening in between the bars.

4 Answers
4

Your question is very vague (e.g. what are you trying to measure, and what "tick data" do you have), but I'll give you some pointers:

In general, when people consider how prices evolve, they will tend to think about things like volatility and correlation dynamics. So I would start by defining exactly what you want to measure. The irregularity of time series data is not a problem in itself, except in so far as you are making assumptions in your calculations about things like dispersion in time. The amount of variation over 1 millisecond will generally be different than over 1 second (and will also vary by asset), so you need to arrange your statistics to account for this.

1.1. There is a vast literature on measuring volatility using high-frequency tick data. Search for papers on realized variance, volatility, and correlation from people like Neil Shepard (see his institute) or Tim Bollerslev. One feature of this literature is that it is actually optimal to not use tick-by-tick data because of what is known as microstructure noise (e.g. bid-ask bounce), and you're generally better making estimates off something like 5-minute data.

Looking at statistics in clock time or trade time is an important distinction. For instance, the number of quotes or trades can vary dramatically across assets, with illiquid assets only trading a few times a day vs. liquid assets which trade many times each second. Using trade time to measure things like volatility can partly address this problem (as well as things like the significance of your estimate), although you will need to consider whether there are other clock time effects (such as open or close time seasonalities) even when you work in trade time.

For tick data, are you working with level 1 (top of the book quotes and trades) or level 2 (full order book) data? If it's level 2, then you may not only want to consider changes through time, but also across the book.

Can you summarize what RTAQ does or what those techniques are that you referred to? Like, what do the tools/techniques do for "in between the bands" analysis?
–
chrisaycockOct 5 '12 at 12:58

To Quote RTAQ manual "The Trades and Quotes data of the New York Stock Exchange is a popular input for the implementation of intraday trading strategies, the measurement of liquidity and volatility and investigation of the market microstructure, among others. This package contains a collection of R functions to carefully clean and match the trades and quotes data, calculate ex post liquidity and volatility measures and detect price jumps in the data". It will help you calculate periodicity, create aggregate bars, trade direction using Lee-Ready Algo, Covariances, Multiple Exchanges.
–
shoonyaOct 5 '12 at 13:07

I am not sure I understand step 2,3 &4. Could you illustrate with a simple example.
–
silencerOct 10 '12 at 1:21

1, 2, 3 & 4 are options, not steps. Either 1, 2, 3 or 4. ad 2) treat time as some variable tightly related to the original time series, maybe forecasting both in order to know where price goes and when it goes there. ad 3) find some small time increment such that all oservations of the original time series roughly fit on some time of your new equidistant time series. ad 4) summarize your data maybe per 500 microseconds and create for example open/high/low/close information for each 500 microsecond batch
–
KonstaOct 10 '12 at 21:06