A college at work pointed me at this interesting paper about getting meaning from unstructured data using LSI.