NAVIGATING HETEROGENEITY TO LEARN FROM LARGE-SCALE CANCER DATA: OPTIMIZATION, REDUNDANCY, AND GENERALIZATION