Main Content Detection in HTML Journal Articles