batch size in NARX model

4 ビュー (過去 30 日間)
tony  gian
tony gian 2018 年 8 月 3 日
回答済み: Kothuri 2025 年 6 月 4 日
If in the NARX model of matlab, the batch size is always the full size of the data and there is no minibatch method, then why should we use the shuffling command net.divideFcn = 'dividerand'; if of course our data is not sequential or in order? How does shuffling help avoid local minima and convergence in this case ?

回答 (1 件)

Kothuri
Kothuri 2025 年 6 月 4 日
The NARX‐training algorithm uses the entire dataset in each training epoch (i.e., “full‐batch” training, not mini‐batches). And the "dividerand" function Splits the data into three sets—Training, Validation, and Test at random. During each epoch, the network uses the entire “training set” to compute weight updates.
  • If your data aren’t shuffled first, a “block” split can inadvertently bias the training set.
  • A biased training set can cause the network to fit only a subset of your operating range and then fail badly on the validation or test sets.
  • By using "dividerand" function, you ensure that all regions of your input‐output space are (approximately) represented in the training portion which fosters better convergence on a truly global solution.
You can refer the below documentation link for more info on "dividerand" function:

カテゴリ

Help Center および File ExchangeDeep Learning Toolbox についてさらに検索

製品


リリース

R2018a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by