shwetashweta05 commited on
Commit
5b48db4
·
verified ·
1 Parent(s): cfd6c5d

Update pages/Life Cycle Of Machine Learning.py

Browse files
pages/Life Cycle Of Machine Learning.py CHANGED
@@ -147,3 +147,161 @@ if st.button("Exploratory Data Analysis (EDA)"):
147
  - Example: A student with Marks = 0 when others scored 70-100 could be an error.
148
  """)
149
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
147
  - Example: A student with Marks = 0 when others scored 70-100 could be an error.
148
  """)
149
 
150
+ if st.button("**Feature Engineering**"):
151
+ st.write("Feature engineering is the process of creating, modifying, or selecting features (columns) in your dataset to make machine learning models work better. Features are the input data that the model uses to learn and make predictions.")
152
+ st.write("""
153
+ Why is Feature Engineering Important?
154
+ - Improves model accuracy and performance.
155
+ - Helps the model understand the data better.
156
+ - Reduces noise and irrelevant information.
157
+ """)
158
+ st.write("""
159
+ **Steps in Feature Engineering:**
160
+ - **1.Select Relevant Features**
161
+ Keep only the columns that are important for the problem.
162
+ - Example: If you’re predicting house prices, keep features like Size, Location, and Year Built, but remove irrelevant ones like Owner's Name.
163
+ - **2.Handle Missing Values**
164
+ Fill or remove missing data to ensure clean features.
165
+ - Example: Fill missing Age values with the average age.
166
+ - **3.Create New Features**
167
+ Combine or transform existing columns to make new useful ones.
168
+ - Example: If you have Date of Birth, create a new feature called Age.
169
+ - **4.Transform Features**
170
+ Modify features to improve their scale or distribution.
171
+ - Normalize or standardize numerical features.
172
+ - Example: Convert house prices in millions to a range between 0 and 1.
173
+ - **5.Encode Categorical Data**
174
+ Convert non-numeric (categorical) data into numbers.
175
+ - One-Hot Encoding: Create new binary columns for each category.
176
+ - Label Encoding: Assign numbers to categories.
177
+ - Example: Convert Color (Red, Blue, Green) into binary columns Is_Red, Is_Blue, and Is_Green.
178
+ - **6.Feature Scaling**
179
+ Ensure all numerical features are on the same scale so one doesn’t dominate the others.
180
+ - Example: Scale features like Salary (in thousands) and Experience (in years) to a similar range.
181
+ - **7.Feature Selection**
182
+ Choose only the most important features to avoid overloading the model.
183
+ - Use methods like correlation analysis, feature importance scores, or PCA (Principal Component Analysis).
184
+ """)
185
+
186
+ if st.button("Training"):
187
+ st.write("Training a machine learning model is the process of teaching the model to make predictions by learning patterns in the data. This is done by showing the model examples (training data) and adjusting it so it performs well.")
188
+ st.write("""
189
+ **Steps in Training a Model:**
190
+ - **1.Prepare the Data**
191
+ Split your data into:
192
+ - Training Set: Used to train the model (usually 70-80% of the data).
193
+ - Testing Set: Used to check how well the model performs on unseen data.
194
+ - Example: If you have 100 rows of student data, use 80 rows for training and 20 rows for testing.
195
+ - **2.Choose a Model**
196
+ Select the algorithm or method to use for predictions. Common models include:
197
+ - Linear Regression (for predicting numbers).
198
+ - Decision Trees (for classification or regression).
199
+ - K-Nearest Neighbors (KNN) (for identifying closest patterns).
200
+ - **3.Train the Model**
201
+ Show the training data to the model so it can learn the patterns.
202
+ - During this process, the model adjusts its internal parameters to minimize errors.
203
+ - Example: A student performance prediction model might learn that Attendance and Study Hours are important for predicting grades.
204
+ - **4.Test the Model**
205
+ Check the model's performance by giving it the testing data (data it hasn't seen before).
206
+ - The model makes predictions, and you compare them to the actual values.
207
+ - Example: If the model predicts a student's grade as A and the actual grade is also A, the prediction is correct.
208
+ - **5.Evaluate the Model**
209
+ Measure how well the model is performing using metrics like:
210
+ - Accuracy: Percentage of correct predictions.
211
+ - Mean Squared Error (MSE): Average error for numerical predictions.
212
+ """)
213
+
214
+ if st.button("Testing"):
215
+ st.write("**Testing a machine learning model is the process of checking how well the model works on new, unseen data. This step helps you understand if the model can make accurate predictions or decisions when applied to real-world scenarios.**")
216
+ st.write("""
217
+ **Why Testing is Important?**
218
+ - Ensures the model doesn’t just memorize the training data but can generalize to new situations.
219
+ - Identifies if the model needs improvement.
220
+ - Measures the model's accuracy, precision, or error rate
221
+ """)
222
+ st.write("""
223
+ **Steps in Testing a Machine Learning Model:**
224
+ - **1.Prepare the Test Data**
225
+ - Use a separate dataset (called the testing set) that the model hasn’t seen during training.
226
+ - Example: If you’re predicting student grades, the test data could include students whose information was not part of the training.
227
+ - **2.Run the Model on Test Data**
228
+ - Use the model to predict outcomes based on the test data's input features.
229
+ - Example: For a grade prediction model, test data might have Study Hours and Attendance as inputs. The model predicts the grade.
230
+ - **3.Compare Predictions to Actual Outcomes**
231
+ - Check how close the predictions are to the real values.
232
+ - Example: If the model predicts Grade = B and the actual grade is also B, it’s correct.
233
+ - **4.Evaluate Performance**
234
+ - Use metrics to measure how well the model is performing:
235
+ - Accuracy: How many predictions were correct?
236
+ - Precision/Recall: Useful for classification problems.
237
+ - Mean Squared Error (MSE): Measures error in numerical predictions.
238
+ - Example: If the model predicts grades for 10 students and gets 9 right, the accuracy is 90%.
239
+ - **5.Analyze Errors**
240
+ - Understand where the model made mistakes to identify areas for improvement.
241
+ - Example: If the model struggles with students with low attendance, you might need more training data for that group.
242
+ """)
243
+
244
+ if st.button("Deployment"):
245
+ st.write("**Deployment is the process of making a trained machine learning model available for real-world use. It allows people or systems to use the model to make predictions or decisions on new data.**")
246
+ st.write("""
247
+ **Why Deployment is Important:**
248
+ - To apply the model to solve real-world problems.
249
+ - To provide predictions or insights for users, apps, or businesses.
250
+ - To continuously monitor and improve the model over time.
251
+ """)
252
+ st.write("""
253
+ **Steps in Deployment:**
254
+ - **1.Prepare the Model**
255
+ - Train and test your model until it performs well.
256
+ - Save the final version of the model.
257
+ - Example: Use Python libraries like joblib or pickle to save the trained model to a file.
258
+ - **2.Set Up a Deployment Environment**
259
+ - Decide where the model will run:
260
+ - On a Cloud Server: For large-scale use (e.g., AWS, Google Cloud, Azure).
261
+ - On a Local System: For small or private applications.
262
+ - **3.Create a User Interface (Optional)**
263
+ - Build an application that users can interact with.
264
+ - Example: Use a web app (like Streamlit or Flask) to let users input data and get predictions.
265
+ - **4.Serve the Model**
266
+ - Set up an API (Application Programming Interface) so the model can receive input and return output.
267
+ - Example: Use Flask or FastAPI to create an API endpoint that the model responds to.
268
+ - **5.Monitor Performance**
269
+ - Continuously track the model's accuracy and performance in the real world.
270
+ - Example: If the model starts making more mistakes, it may need retraining.
271
+ - **6.Update the Model**
272
+ - Retrain the model with new data as the problem or environment evolves.
273
+ - Example: A house price prediction model might need updates as market trends change.
274
+ """)
275
+
276
+ if st.button("Monitoring"):
277
+ st.write("**Monitoring a machine learning model means keeping track of how well it performs after it has been deployed. It helps you make sure the model continues to give accurate predictions when used in the real world.**")
278
+ st.write("""
279
+ **Why Monitoring is Important:**
280
+ - Ensure model accuracy: The model might perform well initially but could start making mistakes over time.
281
+ - Detect problems: For example, if the data changes, the model might need retraining.
282
+ - Keep the model updated: Regular monitoring helps decide when to update or retrain the model.
283
+ """)
284
+ st.write("""
285
+ Steps in Monitoring a Machine Learning Model:
286
+ - **1.Track Model Performance**
287
+ - Measure how well the model is doing after deployment using metrics like:
288
+ - Accuracy: How often the model is correct.
289
+ - Precision and Recall: Important for classification problems.
290
+ - Mean Squared Error (MSE): Useful for regression models.
291
+ - AUC-ROC: Used to evaluate classification models, especially for imbalanced data.
292
+ - **2.Monitor for Data Drift**
293
+ - Data drift happens when the patterns in the new data are different from the data used to train the model.
294
+ - Example: A house price prediction model trained on old data might perform poorly if the market changes.
295
+ - **3.Track Prediction Errors**
296
+ - Look for situations where the model makes large or consistent errors.
297
+ - Example: A fraud detection model might fail to catch new types of fraud.
298
+ - **4.Monitor Model Latency and Speed**
299
+ - Ensure that the model is making predictions quickly enough for real-time use.
300
+ - Example: A recommendation system in an online store needs to suggest products in a few seconds.
301
+ - **5.Check Resource Usage**
302
+ - Keep track of how much computing power and memory the model is using.
303
+ - Example: A large deep learning model might use a lot of resources and slow down a website.
304
+ - **6.Update or Retrain the Model**
305
+ - If the model’s performance drops, you may need to retrain it with new data.
306
+ - Example: If new features (like a new product category) become available, the model should be retrained.
307
+ """)