On Mon, Sep 10, 2007 at 11:18:19AM -0700, Steven Kurylo wrote:
> I have a maildir which currrently has (and growing) 187851 messages in
> it as reported by ls |wc -l
>
> Right now I have about a 180 message ids I need to find in there, but
> I do know most of them are recent:
>
> for i in `cat ~/message.ids.clean`;do echo START;echo $i; find -ctime
> -90 -exec grep -l $i {} \;;echo END;done > ~/results
>
> This is a very slow way to go about this. Even if I did parallelize
> it, hoping the disk cache would make it faster, that still wouldn't
> help much. Can someone think of an obviously faster bash script?
>
> Are there any tools to index the files for me? Or maybe I could parse
> out the interesting bits (message id, date, recipients, etc) and put
> them in a database.
I index my mail with a tool called mairix
<http://www.rpcurnow.force9.co.uk/mairix/> to index my mail folders
(currently 240174 Maildir files). Once indexed, searches are pretty
quick and it hasn't gobbled my email yet.
-- Jason
Received on Mon Sep 10 14:13:36 2007
This archive was generated by hypermail 2.1.8 : Mon Sep 10 2007 - 14:13:39 CST