I believe that this is slightly backwards.
The F-statistic is a joint test for significance that is unaffected by multicollinearity. Multicollinearity will inflate the variances of the estimated coefficients, and therefore, the standard errors for the estimated coefficients, making it more likely that t-tests on individual terms will appear nonsignificant.
If the global (testing all coefficients except the intercept) F-test results in failing to reject the null hypothesis, the model is statistically useless. None of the terms tested were statistically useful for predicting the dependent variable.
In sum, multicollinearity will affect individual t-statistics through the estimated standard errors of the related coefficients. Therefore, an indicator of multicollinearity is that although no individual t-statistics for slope coefficients are significant, the F-test (joint significance) shows that at least one of the tested coefficients is different from zero.
There are other measures to identify the presence/degree of MC, but I think this answer is most relevant.
Also, to add to Dwheats: when evaluating a model (assuming a significant F-Test), it is imperative to examine other model-based statistics such as R-squared, the SER, CV, etc to determine the practical utility of the model. In a statistical sense, using R-squared by itself is not any form of “test” (no hypotheses, measure of reliability, general conventions…)
To the original question: If you failed to reject the null, it would indicate that your (tested) model is no better than using the sample mean for prediction (as you said, horizontal line). However, this just means we need to try new variables to help us predict our DV (it does not imply that the TRUE model is a horizontal line).