From b6471ecbc1835d8ae420edcb4e8a43fac7111a52 Mon Sep 17 00:00:00 2001 From: narae970210 <60998194+narae970210@users.noreply.github.com> Date: Sun, 31 May 2020 23:19:35 -0400 Subject: [PATCH 1/2] Added proposal.md, blog.md --- .DS_Store | Bin 0 -> 8196 bytes Intro_to_prob/blog.md | 57 ++++++++++++++++++++++++++++++++++++++ Intro_to_prob/proposal.md | 32 +++++++++++++++++++++ 3 files changed, 89 insertions(+) create mode 100644 .DS_Store create mode 100644 Intro_to_prob/blog.md create mode 100644 Intro_to_prob/proposal.md diff --git a/.DS_Store b/.DS_Store new file mode 100644 index 0000000000000000000000000000000000000000..9ce3de60e384a3d528ec9b3a127fe8583ebb0ddb GIT binary patch literal 8196 zcmeHM%Wl&^6ur|IPM~TRRL#;GBoPif#~6W}=xh9^W%F_hiP-oe&YL4%{uG5)m2599OTS zn3DK?p3kK@)v^X;;1f9%P(&@-r0zLu-JliF3TOqi0$KsB!2h5C-q}1E7rgi7U0rGg zv;s4!0DnH1$Q&yQClbY@1BD0yfD33A33bE)f@3PID4a+XRP-sM2d1b@g&0iIao^30&L`f$m>BQ8?OhqV6j1HbD%!ySb>QXD96*#W|kKJ!jm3)e^DcL>ARhp3ppGi zuFykk7r9RjoHlGyhumCG?{hsZQ5#&&^C&k{_A_TYPlxn?4zfK8&p)I*H>4HSX`hZz z&-F^&|Fr!|VsByAF|rNseBoJ>yzFQ~i8OmLVISu#xuahBsl z!zMHhFt2c$qTGLGr2JpTIQ*O8n%055l;4b#8|?>}D}TpyhW)NuZC{;k;P>Nl`5T*? zUsznaW*9}IxLkZ;x1*sQH{wCR+UPywS4WMw?Wp?WC<>B#!+GL+!}{vYgD8v}e(3cC z*>^pdJU{V6HyT!>LFo2HKeoJK7zLwHUoDMBJA2z3=61O}-Y`eIyX6gYXZ!AWTrk#e zZQVaS?Yw?7dOLpi5n>`S^;V)4d7YG>P`gB|ptWT;qfR%mztp-DS(XyW_ZCM3tPB0m4OYDVWfoDRjt5mDKIbHE%ErjcJcZD zY>A@1)e6j*0&Kyutty=U^D8FrKaaIdRjPDSBF UB3h9C`9Xkw{|nu9G206K20Hdu#sB~S literal 0 HcmV?d00001 diff --git a/Intro_to_prob/blog.md b/Intro_to_prob/blog.md new file mode 100644 index 0000000..83337ec --- /dev/null +++ b/Intro_to_prob/blog.md @@ -0,0 +1,57 @@ +# Probability and Statistics + +## What is a probability? Why do we learn probability and statistics in Machine Learning? + +You are probably wondering why probability is a material in Computer Science and Machine Learning. But you are on the right page and here you are about to dive into the fundamental concepts of Machine Learning. Probability is the basis of lots of ML algorithms, such as Naive Bayes Algorithm and Expectation-Maximization algorithm, and much more. + +How much do we need to know to implement these algorithms? We can start off with two basic concepts, Conditional Probability and Joint Probability and we can expand our knowledge as we go! + + +## Probability + +Probability is an extent to which an event is likely to occur. A probability as a number lies between 0 and 1; 0 means that an event will NEVER happen, and 1 means that an event WILL happen. A probability of 0.3 means an event would happen with a 30% chance and will not happen with a 70% chance. + +Often, we use a notation of P(A), which refers to the probability that event A will occur. + +Pop Quiz: +Suppose the probability that event B will occur is 0.25. Write this expression in a mathematical notation. + + + +## Conditional Probability + +Conditional probability is a measure of the probability of an event occurring given that another event has occurred. It’s denoted in a form of P(A|B), which refers to the probability that event A occurs given that event B has occurred. + +Suppose that the probability that Hannah would bring an umbrella when it rains is 0.85. Let U represent the event that Hannah brings an umbrella and let R represent the event that it rains outside. This statement can be expressed as P(U|R) = 0.85 + +Pop Quiz: +Suppose the probability that event B occurs given that event C has occurred is 0.7. Write this expression in a mathematical notation. + + + +## Joint Probability + +P(A and B) refers to the probability that event A and event B both occur. In a mathematical notation, we often use P(A ∩ B). +One thing to note is that P(A and B) = P(B and A) and P(A ∩ B) = P(B ∩ A). + +But, P(A and B) will be different depending on the dependence of the two events. + +If the outcome of an event is not dependent on the outcome of the other event, they are independent. +  P(A and B) = P(A) * P(B) + +If the outcome of an event affects the outcome of the other event as in the example of Hannah bringing an umbrella when it rains, they are dependent. +  P(A and B) = P(A|B) * P(B) + +Pop Quiz: +Given the following probabilities, calculate the probability that both events R and U (It rains outside AND Hannah brings an umbrella) occur. +R: It rains outside. P(R) = 0.3 +U: Hannah brings an umbrella. P(U) = 0.1 +P(U|R) = 0.85 + + +## Summary +In this chapter, you learned the basic concepts of probability, dependence and independence of events, conditional probability, and joint probability. Now you are ready to learn Bayes’ theorem, which uses all of these probabilities you just learned! + +To learn about Bayes’ theorem, here is a helpful resource! +https://www.youtube.com/watch?v=XQoLVl31ZfQ + diff --git a/Intro_to_prob/proposal.md b/Intro_to_prob/proposal.md new file mode 100644 index 0000000..4c98fdd --- /dev/null +++ b/Intro_to_prob/proposal.md @@ -0,0 +1,32 @@ +# TEMPLATE + +## :fire: Do not edit this file - copy the template and create your own file. + +**[Step-By-Step Technical Blog Guide](https://hq.bitproject.org/how-to-write-a-technical-blog/)** + +### :pushpin: Step 1 +**TITLE:** +Why should we know Probability in Machine Learning? + +**TOPIC:** +Probability and Statistics + +**DESCRIPTION (5-7+ sentences):** +This technical blog covers probability, joint probability, and conditional probability. It also covers dependence / independence of events and how the dependency affects the calculation of joint probability. These are the basic concepts for Bayes’ rule and Bayes’ rule is widely used in Artificial Intelligence and Machine Learning. This blog provides definition, notation, example, and a pop-quiz for the audience to try. + + +### :pushpin: Step 2 +:family: **TARGET AUDIENCE (3-5+ sentences):** +Any students who are taking ML class and get stuck on the probability part. At least at my school, prob and stats class is not a prerequisite for an AI class and I have seen lots of friends struggling in the probability part. Target audience could be anyone who wants to start learning ML and does not have a probability and statistics background. + +### :pushpin: Step 3 +> Outline your learning/teaching structure: + +**Beginning (2-3+ sentences):** +start with why we should learn probability in ML and list out a few ML algorithms that use probability. + +**Middle (2-3+ sentences):** +Explain different types of probabilities in detail. Has simple and easy examples for students to follow and a pop-up quiz section. + +**End (2-3+ sentences):** +Summarize what we have learned and provide next steps. For the next steps, I provide a 5-min Youtube link that covers Bayes’ theorem. From 1a456c86849acaea95521ff3fdfaac88947f2739 Mon Sep 17 00:00:00 2001 From: narae970210 <60998194+narae970210@users.noreply.github.com> Date: Mon, 1 Jun 2020 17:22:05 -0400 Subject: [PATCH 2/2] Proposal for Data Visualization --- .DS_Store | Bin 8196 -> 8196 bytes Big Data - Data Viz/proposal.md | 29 +++++++++++++++++++++++++++++ IMC_DataScience/.DS_Store | Bin 0 -> 6148 bytes 3 files changed, 29 insertions(+) create mode 100644 Big Data - Data Viz/proposal.md create mode 100644 IMC_DataScience/.DS_Store diff --git a/.DS_Store b/.DS_Store index 9ce3de60e384a3d528ec9b3a127fe8583ebb0ddb..e83865712559b47dd9639702693e12da5ffbc162 100644 GIT binary patch delta 510 zcmZp1XmOa}FDk>pz`)4BAi%)j!jQ;N0>laox*H3pu}^H^-OSFx!ch-WA!kjan^ zltWU5!V3e+S9#{-Cnx3PCjoT;4P>wZ8lChX41heMP02|&3{K9^Edc9bs3%2hZoZ2P z)Bzk1c25hq;&jN7c#CuMQ!sT~67BAS3}kmRFihqV&}MX;Y$2e@xPEeyfH9-&%pxg0etu zrv#0G+WrX|vjY9Z$T7J~&ydcb$561ba2or>2Hwr=94s7+qLZBjwHd7^ z=Lu>uuADqW(3nwt@)>{#2 zm8v4fK$Vd|(t2{Qh$ZU>pr!niCyJ&rER-hCIMEk} Outline your learning/teaching structure: + +**Beginning (2-3+ sentences):** +Explanation of what we are going to cover in the blog. I will explain the dataset and two different sheets I am going to create. + +**Middle (2-3+ sentences):** +The first sheet will have a table with state, county, # cases, # deaths columns. In this tab, I will apply a county filter where users can select all or specific counties and I will use NY data only for simplicity. + +The second sheet will have a bar graph or line graph indicating the number of confirmed cases in the states with most coronavirus cases, such as NY, NJ, IL, and CA. In this sheet, the audience will learn how to create a graph and apply a date range filter. + +**End (2-3+ sentences):** +Summarize the technical skills that we cover in the blog and provide additional resources if needed. \ No newline at end of file diff --git a/IMC_DataScience/.DS_Store b/IMC_DataScience/.DS_Store new file mode 100644 index 0000000000000000000000000000000000000000..3024f00f2fb062f5147cd78a2f3800ee223b57ed GIT binary patch literal 6148 zcmeHKK}#z!5T4YkX&+(_f*x}b-l2z;iV7kuZ9Vj$2rIsWKI*P5y0C7f-CDF(_HXDv z@)!7XbS6pVJ*s$n%3}s5-|S>2yZI859RLvZL1zu10ss<~uu#J09ie{GDJeM545Bb& zG$DW(QrHJQ70r(C$N;U~BD7!+8qkJ6*DuAjJcM1?YbE+DN_2#gv`w~Af4&$&>(lIL z*uQewPtjEp_cPCX7KNGO?A*MRWm#S*{|+wnFv!Ae(65KRV|qFWvx~^thgv6Zb{HMS z-C?`3)X-@b#%Z@_lH*PnLr#9hX-5z1dXRSdW}LuwNLiAlc4cKWTHmTwRn7CpRW;h& z^r~vTwlN-;}yWk&{s-*^JBp zGr$adIRoxsl0UxuOZWqr0cPO4F+k^oLM8Ma<`&J-fsH8v5R2$Gf;RmmQjT)yJIpO& z4~np