> what I was thinking. I did add some comments with some ideas on how to
> implement it, but basically all you would really need is the inode ID
> and last left of location and you can pick up where you left off. So
> when you first start up you can compare the inode ID to file name your
> expecting. If they don't match finish sending off that file then go
> get the actual new files your suppose to be following.
>
> Now the only down fall would be what happens if log rotate rolls the
> log twice while Flume is down. You wouldn't be able to get that log
> file in the middle. In those cases you would need to manually
> intervene but at least its a whole lot easier.
>
>
>
> On Fri, Jan 14, 2011 at 12:08 PM, Jonathan Hsieh <
j...@cloudera.com> wrote:
> > Nick,
> > This isn't available out-of-the-box currently and my first thought is that
> > this is a new feature.
> > My worry with this is what happens if the agent is down a for a while and
> > the file it was following has been moved or renamed. One approach may be to
> > save off some checksum/offset data periodically. To recover we could
> > re-read but not resend if the checksum matches.
> > Alternately there was another proposal that roughly boils down to some kind
> > of ability to change sink without changing (and thus losing the state) of a
> > source.
https://issues.cloudera.org/browse/FLUME-463
> > Jon.
>