I will be releasing the part tranche of data in the 775,000 word lexical database in Mid-August. It will most probably be available as a series of downloads at pamanyungan.net. In order to download the data, you will need to register and agree to some terms and conditions. More about that once the data are released.
In the meantime, I will be doing a series of posts about features of the dataset and some of its uses. I hope this will encourage others to contribute data, or to allow us to make data readily available.