Heterogeneous data collection in the marine environment has led to large gaps in our knowledge of marine species distributions. To fill these gaps, models calibrated on existing data may be used to predict species distributions in unsampled areas, given that available data are sufficiently representative. Our objective was to evaluate the feasibility of mapping cetacean densities across the entire Mediterranean Sea using models calibrated on available survey data and various environmental covariates. We aggregated 302,481 km of line transect survey effort conducted in the Mediterranean Sea within the past 20 years by many organisations. Survey coverage was highly heterogeneous geographically and seasonally: large data gaps were present in the eastern and southern Mediterranean and in non-summer months. We mapped the extent of interpolation versus extrapolation and the proportion of data nearby in environmental space when models calibrated on existing survey data were used for prediction across the entire Mediterranean Sea. Using model predictions to map cetacean densities in the eastern and southern Mediterranean, characterised by warmer, less productive waters, and more intense eddy activity, would lead to potentially unreliable extrapolations. We stress the need for systematic surveys of cetaceans in these environmentally unique Mediterranean waters, particularly in non-summer months.

Predictive models are central to many scientific disciplines and vital for informing management in a rapidly changing world. However, limited understanding of the accuracy and precision of models transferred to novel conditions (their 'transferability') undermines confidence in their predictions. Here, 50 experts identified priority knowledge gaps which, if filled, will most improve model transfers. These are summarized into six technical and six fundamental challenges, which underlie the combined need to intensify research on the determinants of ecological predictability, including species traits and data quality, and develop best practices for transferring models. Of high importance is the identification of a widely applicable set of transferability metrics, with appropriate tools to quantify the sources and impacts of prediction uncertainty under novel conditions.

Ensemble niche modelling has become a common framework to predict changes in assemblages composition under climate change scenarios. The amount of uncertainty generated by the different components of this framework has rarely been assessed. In the marine realm forecasts have usually focused on taxa representing the top of the marine food-web, thus overlooking their basal component: the plankton. Calibrating environmental niche models at the global scale, we modelled the habitat suitability of 106 copepod species and estimated the dissimilarity between present and future zooplanktonic assemblages in the surface Mediterranean Sea. We identified the patterns (species replacement versus nestedness) driving the predicted dissimilarity, and quantified the relative contributions of different uncertainty sources: environmental niche models, greenhouse gas emission scenarios, circulation model configurations and species prevalence. Our results confirm that the choice of the niche modelling method is the greatest source of uncertainty in habitat suitability projections. Presence-only and presence-absence methods provided different visions of the niches, which subsequently lead to different future scenarios of biodiversity changes. Nestedness with decline in species richness is the pattern driving dissimilarity between present and future copepod assemblages. Our projections contrast with those reported for higher trophic levels, suggesting that different components of the pelagic food-web may respond discordantly to future climatic changes.

Coral reefs and their associated fauna are largely impacted by ongoing climate change. Unravelling species responses to past climatic variations might provide clues on the consequence of ongoing changes. Here, we tested the relationship between changes in sea surface temperature and sea levels during the Quaternary and present-day distributions of coral reef fish species. We investigated whether species-specific responses are associated with life-history traits. We collected a database of coral reef fish distribution together with life-history traits for the Indo-Pacific Ocean. We ran species distribution models (SDMs) on 3,725 tropical reef fish species using contemporary environmental factors together with a variable describing isolation from stable coral reef areas during the Quaternary. We quantified the variance explained independently by isolation from stable areas in the SDMs and related it to a set of species traits including body size and mobility. The variance purely explained by isolation from stable coral reef areas on the distribution of extant coral reef fish species largely varied across species. We observed a triangular relationship between the contribution of isolation from stable areas in the SDMs and body size. Species, whose distribution is more associated with historical changes, occurred predominantly in the Indo-Australian archipelago, where the mean size of fish assemblages is the lowest. Our results suggest that the legacy of habitat changes of the Quaternary is still detectable in the extant distribution of many fish species, especially those with small body size and the most sedentary. Because they were the least able to colonize distant habitats in the past, fish species with smaller body size might have the most pronounced lags in tracking ongoing climate change.

Habitat modelling is increasingly relevant in biodiversity and conservation studies. A typical application is to predict potential zones of specific conservation interest. With many environmental covariates, a large number of models can he investigated but multi-model inference may become impractical. Shrinkage regression overcomes this issue by dealing with the identification and accurate estimation of effect size for prediction. In a Bayesian framework we investigated the use of a shrinkage prior, the Horseshoe, for variable selection in spatial generalized linear models (GLM). As study cases, we considered 5 datasets on small pelagic fish abundance in the Gulf of Lion (Mediterranean Sea, France) and 9 environmental inputs. We compared the predictive performances of a simple kriging model, a full spatial GLM model with independent normal priors for regression coefficients, a full spatial GLM model with a Horseshoe prior for regression coefficients and 2 zero-inflated models (spatial and non-spatial) with a Horseshoe prior. Predictive performances were evaluated by cross validation on a hold-out subset of the data: models with a Horseshoe prior performed best, and the full model with independent normal priors worst. With an increasing number of inputs, extrapolation quickly became pervasive as we tried to predict from novel combinations of covariate values. By shrinking regression coefficients with a Horseshoe prior, only one model needed to be fitted to the data in order to obtain reasonable and accurate predictions, including extrapolations.