I find this interesting. If this isn't the place to pursue it then I'd
be interested in subscribing to that mailing list. :-D
cheers,
Bruce
On Wed, Sep 5, 2012 at 5:11 PM, George Kousiouris <gkousiou@mail.ntua.gr> wrote:
>
> Hi all,
>
> As part of the research for an ongoing project, we are interested in
> investigating the ability to predict data access patterns on a hadoop
> cluster. The purpose is to study the file access patterns (in a time series
> manner), so that proactive manipulation of data may be achieved. This for
> example may involve the increase/decrease of the replication factor in an
> Apache Hadoop cluster (and according HDFS) to deal with an upcoming
> predicted increase/decrease of data accesses.
>
> So we would like your advise on some issues:
> 1) is this the correct mailing list? :)
> 2) would a changed replication factor translate to a better performance of a
> MR job (either by experience you may have or if you have in mind a
> report/paper etc. that has studied this)
> 3) do you find this interesting in general and something we should pursue?
> 4) are you aware of any related work on the topic we could use as a starting
> point?
>
> Thanks for your help,
> George
>
--
@otfrom | CTO & co-founder @MastodonC | mastodonc.com