zhongxiuliu

‎06-13-2023

SAS Employee

Member since

8 Posts
1 Likes Given
7 Solutions
6 Likes Received

Follow zhongxiuliu

Re: Initialization Procedure vs Preliminary Training

1043

‎11-02-2020 09:17 AM

Re: Use of nominal and ordinal inputs in GANN

633

‎11-02-2020 09:10 AM

Re: Use of Validation data with Sequential Network Construction

615

‎11-02-2020 08:56 AM

Re: Weight Decay vs Early Stopping

1652

‎11-02-2020 08:40 AM

Re: Difference between MLogistic and SoftMax activation functions

666

‎11-02-2020 08:32 AM

Activity Feed for zhongxiuliu

Liked 2023 Customer Awards: Georgia Pacific - Curious Thinker for slcoyne86. ‎06-13-2023 09:43 AM
Got a Like for Re: LSTM with dltrain. ‎09-05-2021 03:20 AM
Got a Like for Re: LSTM with dltrain. ‎09-02-2021 03:02 AM
Got a Like for Re: LSTM with dltrain. ‎09-02-2021 03:01 AM
Posted Re: LSTM with dltrain on SAS Data Science. ‎09-01-2021 09:14 AM
Got a Like for Re: LSTM with dltrain. ‎08-29-2021 10:25 AM
Got a Like for Re: LSTM with dltrain. ‎08-27-2021 11:53 AM
Got a Like for Re: LSTM with dltrain. ‎08-27-2021 11:50 AM
Posted Re: LSTM with dltrain on SAS Data Science. ‎08-27-2021 09:02 AM
Posted Re: Sensitivity Based Pruning on SAS Academy for Data Science. ‎11-02-2020 09:27 AM
Posted Re: Initialization Procedure vs Preliminary Training on SAS Academy for Data Science. ‎11-02-2020 09:17 AM
Posted Re: Use of nominal and ordinal inputs in GANN on SAS Academy for Data Science. ‎11-02-2020 09:10 AM
Posted Re: Use of Validation data with Sequential Network Construction on SAS Academy for Data Science. ‎11-02-2020 08:56 AM
Posted Re: Weight Decay vs Early Stopping on SAS Academy for Data Science. ‎11-02-2020 08:40 AM
Posted Re: Difference between MLogistic and SoftMax activation functions on SAS Academy for Data Science. ‎11-02-2020 08:32 AM

Posts I Liked

Subject

Likes

Author

2023 Customer Awards: Georgia Pacific - Curious Thinker

104

My Liked Posts

Subject

Likes

Posted

I meant objective function and loss function are often used to describe the same thing 🙂 However, they are different from error function. Both function are error function (the error) + regularization (e.g., the squared or absolute value of weights; some people call it R1, R2; some people call it Lasso Ridge). The reason behind this is: if we just minimize the error, we can easily get a model with very big weights, which makes our activation function's slope really deep (a little change in x, causes big change in y); our model would overfit, unstable and sensitive to noise . Minimizing both error and the weights, makes our neural network less sensitive to noises in data, and more generalizable.

Likes given to

User

Likes Count

Likes from

User

Likes Count

Online Status	Offline
Date Last Visited	‎06-13-2023 01:04 PM