In recent years, Japanese text-to-speech (TTS) synthesis methods have been actively researched. We need to estimate appropriate prosodic information for generating a high-quality synthetic speech. However, manual annotation is costly, and automatic annotation introduces estimation errors. This paper examines the integration of accent sandhi and prosodic feature estimation in the acoustic modeling for Japanese TTS to overcome the problems. The proposed method achieves total optimization of the F0 model by using the linguistic features from a dictionary. Objective and subjective evaluations confirmed that the cost of creating accent labels was reduced, and the accuracy of the prosodic feature estimation was improved.