@peter. I decided to take up and do what you had been trying to do. i have mailed alex gakuru the parsed monthly archives. Now some months were missing from the archives and all i had was up to March 2009So there are about four archive files which still need to be parsed.

To do this I developed a tool which has done this job very well. Am sending Alex the source code for the tool which can be compiled on any OS as long as it has a C++ compiler .

Now all i developed was the tool and used it to parse all the files, i have also given the schema for the data file and the query to load it into a MYSQL database. So as far as code to make the data searchable i dont think i can squeez in time for that so maybe you or someone from this list can take it up and develop one for our sakes.

Steve Obbayi, 

 

 


From: skunkworks-bounces@lists.my.co.ke [mailto:skunkworks-bounces@lists.my.co.ke] On Behalf Of Peter Karunyu
Sent: Tuesday, April 07, 2009 6:08 PM
To: Skunkworks forum
Subject: Re: [Skunkworks] Skunkworks List and available resources

Hi Alex,
I think its time I did what I've been waiting for someone to do; load all the past conversations into a searchable database. I had started with the text file for march, I went through it looking for unique sequences which i could then use as "markers" separating one thread from another, then use the PHP explode function to load the individual threads into a MySQL table.

My main motivation was to be able to search past entries.

Has someone done something similar?

On Mon, Apr 6, 2009 at 9:02 PM, Gakuru Alex <alexgakuru.lists@gmail.com> wrote:
Did we also lose all our past talk:-( I notice
http://lists.my.co.ke/pipermail/skunkworks/ only has the current
month's archives. On 25th last month, I downloaded the following from
the previous server archives:-

2008-March.txt.gz
2008-April.txt.gz
2008-June.txt.gz
2008-July.txt.gz
2008-August.txt.gz
2008-September.txt.gz
2008-October.txt.gz
2008-November.txt.gz
2008-December.txt.gz

2009-January.txt.gz
2009-February.txt.gz
2009-March.txt.gz

5 MB in total. If  needed for institutional memory/restore and KeNIC
does not have the full backups just let me know to send.

regards,

Alex


On Mon, Apr 6, 2009 at 6:33 PM, Lmwangi <lmwangi@gmail.com> wrote:
> Hi all,
>  * As you may have noticed, we are on a new server. Migrating to the
> new box resulted in a loss of your  settings (Digests/Plain text mails
> etc.).
>   Please visit
> http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks to adjust
> your personal settings
>  * We also do have an announce list for products/meeting
> announcements. You may want to subscribe through
> http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks-announce.
>  * my.co.ke hosts a bulletin board that you can visit at
> http://my.co.ke/phpbb/index.php . We intend to use it to complement
> the mailing list.
>  * To find out about all the services available, my.co.ke index page
> has a simple listing
>  * If you have any questions/request please reply to this thread.
>
> Regards,
> Laban
> _______________________________________________
> Skunkworks mailing list
> Skunkworks@lists.my.co.ke
> http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks
> Other services @ http://my.co.ke
>
_______________________________________________
Skunkworks mailing list
Skunkworks@lists.my.co.ke
http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks
Other services @ http://my.co.ke