Mobile Health (mhealth) applications are increasing in popularity, and the collection of disease-specific time series data using Ecological Momentary Assessment (EMA) questionnaires has been shown to help in the creation of personalised predictors for next-step forecasting, which can be crucial in giving preemptive interventions. In this work, we propose a framework that aims to mitigate a common issue in EMA data - that some users contribute a bulk of the data while most users contribute too little. Our proposed framework aims to discover a ‘useful’ neighbourhood of ‘long’ users for ‘short’ ones, by optimising for the error of the user-level predictors for users with little data available for learning. For each user-level predictor, this is done by iteratively adding the next-most-similar long user from a similarity-ordered list as long as the error of the learned model does not increase. This method is compared against a baseline that exploits all available data for the long users, as well as an exhaustive search model that retains only only those users that yield the lowest error predictor. We also explore multiple ways to define similarity, and study the impact of each on the two search strategies on two EMA datasets from an mHealth app ‘TrackYourDiabetes’ - with users from Bulgaria and Spain. Our experiments over the two datasets show a 2.5% and 36.6% improvement respectively for RMSE while using on average 42.8% and 69.9% less data than the baseline method.