refactoring

9d606fe8 · Jonathan Poalses · df80236c · 9d606fe8
Commit 9d606fe8 authored May 26, 2023 by Jonathan Poalses
Show whitespace changes
Inline Side-by-side

Showing with 59 additions and 59 deletions

numbers_ml.ipynb numbers_ml.ipynb +59 -59

No files found.
--- a/numbers_ml.ipynb
+++ b/numbers_ml.ipynb
@@ -24,7 +24,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 928,
+   "execution_count": 973,
   "outputs": [],
   "source": [
    "# Importing pyplot so we can visualize things\n",
@@ -78,8 +78,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.540587Z",
+     "end_time": "2023-05-26T14:22:29.565097Z",
-     "start_time": "2023-05-26T14:17:35.461406Z"
+     "start_time": "2023-05-26T14:22:29.468094Z"
    }
   }
  },
@@ -95,13 +95,13 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 929,
+   "execution_count": 974,
   "outputs": [
    {
     "data": {
      "text/plain": "array([0, 1, 2, ..., 8, 9, 8])"
     },
-     "execution_count": 929,
+     "execution_count": 974,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -116,15 +116,15 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.542342Z",
+     "end_time": "2023-05-26T14:22:29.588144Z",
-     "start_time": "2023-05-26T14:17:35.468601Z"
+     "start_time": "2023-05-26T14:22:29.472846Z"
    }
   }
  },
  {
   "cell_type": "markdown",
   "source": [
-    "As can be seen, the digits dataset contains images labelled with the integers 0 through 9."
+    "As can be seen, the digits dataset contains single dimensional arrays containing the integers 0 through 9, representing the class of the associated image."
   ],
   "metadata": {
    "collapsed": false
@@ -132,13 +132,13 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 930,
+   "execution_count": 975,
   "outputs": [
    {
     "data": {
      "text/plain": "array([[[ 0.,  0.,  5., ...,  1.,  0.,  0.],\n        [ 0.,  0., 13., ..., 15.,  5.,  0.],\n        [ 0.,  3., 15., ..., 11.,  8.,  0.],\n        ...,\n        [ 0.,  4., 11., ..., 12.,  7.,  0.],\n        [ 0.,  2., 14., ..., 12.,  0.,  0.],\n        [ 0.,  0.,  6., ...,  0.,  0.,  0.]],\n\n       [[ 0.,  0.,  0., ...,  5.,  0.,  0.],\n        [ 0.,  0.,  0., ...,  9.,  0.,  0.],\n        [ 0.,  0.,  3., ...,  6.,  0.,  0.],\n        ...,\n        [ 0.,  0.,  1., ...,  6.,  0.,  0.],\n        [ 0.,  0.,  1., ...,  6.,  0.,  0.],\n        [ 0.,  0.,  0., ..., 10.,  0.,  0.]],\n\n       [[ 0.,  0.,  0., ..., 12.,  0.,  0.],\n        [ 0.,  0.,  3., ..., 14.,  0.,  0.],\n        [ 0.,  0.,  8., ..., 16.,  0.,  0.],\n        ...,\n        [ 0.,  9., 16., ...,  0.,  0.,  0.],\n        [ 0.,  3., 13., ..., 11.,  5.,  0.],\n        [ 0.,  0.,  0., ..., 16.,  9.,  0.]],\n\n       ...,\n\n       [[ 0.,  0.,  1., ...,  1.,  0.,  0.],\n        [ 0.,  0., 13., ...,  2.,  1.,  0.],\n        [ 0.,  0., 16., ..., 16.,  5.,  0.],\n        ...,\n        [ 0.,  0., 16., ..., 15.,  0.,  0.],\n        [ 0.,  0., 15., ..., 16.,  0.,  0.],\n        [ 0.,  0.,  2., ...,  6.,  0.,  0.]],\n\n       [[ 0.,  0.,  2., ...,  0.,  0.,  0.],\n        [ 0.,  0., 14., ..., 15.,  1.,  0.],\n        [ 0.,  4., 16., ..., 16.,  7.,  0.],\n        ...,\n        [ 0.,  0.,  0., ..., 16.,  2.,  0.],\n        [ 0.,  0.,  4., ..., 16.,  2.,  0.],\n        [ 0.,  0.,  5., ..., 12.,  0.,  0.]],\n\n       [[ 0.,  0., 10., ...,  1.,  0.,  0.],\n        [ 0.,  2., 16., ...,  1.,  0.,  0.],\n        [ 0.,  0., 15., ..., 15.,  0.,  0.],\n        ...,\n        [ 0.,  4., 16., ..., 16.,  6.,  0.],\n        [ 0.,  8., 16., ..., 16.,  8.,  0.],\n        [ 0.,  1.,  8., ..., 12.,  1.,  0.]]])"
     },
-     "execution_count": 930,
+     "execution_count": 975,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -150,8 +150,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.543047Z",
+     "end_time": "2023-05-26T14:22:29.588321Z",
-     "start_time": "2023-05-26T14:17:35.505976Z"
+     "start_time": "2023-05-26T14:22:29.488735Z"
    }
   }
  },
@@ -176,7 +176,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 931,
+   "execution_count": 976,
   "outputs": [
    {
     "data": {
@@ -203,15 +203,15 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.781215Z",
+     "end_time": "2023-05-26T14:22:29.706103Z",
-     "start_time": "2023-05-26T14:17:35.523679Z"
+     "start_time": "2023-05-26T14:22:29.500556Z"
    }
   }
  },
  {
   "cell_type": "markdown",
   "source": [
-    "Unfortunately, this data cannot immediately be used with a classifier, for that we would need to flatten the images, turning it from an 8x8 two dimensional array, into a 64 long one dimensional array."
+    "Unfortunately, this data cannot immediately be used with a classifier, as while it is an 8x8 2D array, the labels are one dimensional arrays; For that we would need to flatten the images, turning it from an 8x8 two-dimensional array, into a 64 long one dimensional array."
   ],
   "metadata": {
    "collapsed": false
@@ -219,13 +219,13 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 932,
+   "execution_count": 977,
   "outputs": [
    {
     "data": {
      "text/plain": "array([[ 0.,  0.,  5., ...,  0.,  0.,  0.],\n       [ 0.,  0.,  0., ..., 10.,  0.,  0.],\n       [ 0.,  0.,  0., ..., 16.,  9.,  0.],\n       ...,\n       [ 0.,  0.,  1., ...,  6.,  0.,  0.],\n       [ 0.,  0.,  2., ..., 12.,  0.,  0.],\n       [ 0.,  0., 10., ..., 12.,  1.,  0.]])"
     },
-     "execution_count": 932,
+     "execution_count": 977,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -238,40 +238,40 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.788274Z",
+     "end_time": "2023-05-26T14:22:29.711689Z",
-     "start_time": "2023-05-26T14:17:35.784119Z"
+     "start_time": "2023-05-26T14:22:29.708695Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 933,
+   "execution_count": 978,
   "outputs": [
    {
     "data": {
      "text/plain": "True"
     },
-     "execution_count": 933,
+     "execution_count": 978,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "# Check to see if the first dimension of the data tuples match\n",
+    "# Check to see if the first dimension of the data shape tuples match\n",
    "(flat_images.shape[0] == data.target.shape[0])"
   ],
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.794289Z",
+     "end_time": "2023-05-26T14:22:29.718913Z",
-     "start_time": "2023-05-26T14:17:35.789880Z"
+     "start_time": "2023-05-26T14:22:29.712786Z"
    }
   }
  },
  {
   "cell_type": "markdown",
   "source": [
-    "As can be seen, the image data now matches the target data in size, both being a 1797 long set of arrays."
+    "As can be seen, the image data now matches the target data in size, both being a 1797 long set of one dimensional arrays."
   ],
   "metadata": {
    "collapsed": false
@@ -317,7 +317,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 934,
+   "execution_count": 979,
   "outputs": [],
   "source": [
    "# We'll start by splitting the data into training and testing, going with a 75% train, 25% test split, a 50/50 split, and a 25% train 75% test split.\n",
@@ -329,8 +329,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:35.834391Z",
+     "end_time": "2023-05-26T14:22:29.735985Z",
-     "start_time": "2023-05-26T14:17:35.801032Z"
+     "start_time": "2023-05-26T14:22:29.718702Z"
    }
   }
  },
@@ -345,7 +345,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 935,
+   "execution_count": 980,
   "outputs": [],
   "source": [
    "# First the Gaussian Bayes\n",
@@ -378,8 +378,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:38.890650Z",
+     "end_time": "2023-05-26T14:22:32.938039Z",
-     "start_time": "2023-05-26T14:17:35.810495Z"
+     "start_time": "2023-05-26T14:22:29.736114Z"
    }
   }
  },
@@ -394,7 +394,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 936,
+   "execution_count": 981,
   "outputs": [
    {
     "name": "stdout",
@@ -423,14 +423,14 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:38.902252Z",
+     "end_time": "2023-05-26T14:22:32.948030Z",
-     "start_time": "2023-05-26T14:17:38.891565Z"
+     "start_time": "2023-05-26T14:22:32.939526Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 937,
+   "execution_count": 982,
   "outputs": [
    {
     "name": "stdout",
@@ -459,14 +459,14 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:38.996083Z",
+     "end_time": "2023-05-26T14:22:33.016620Z",
-     "start_time": "2023-05-26T14:17:38.913608Z"
+     "start_time": "2023-05-26T14:22:32.950475Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 938,
+   "execution_count": 983,
   "outputs": [
    {
     "name": "stdout",
@@ -495,22 +495,22 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:39.209822Z",
+     "end_time": "2023-05-26T14:22:33.235186Z",
-     "start_time": "2023-05-26T14:17:38.996267Z"
+     "start_time": "2023-05-26T14:22:33.018553Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 939,
+   "execution_count": 984,
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "0.8377777777777777\n",
+      "0.8711111111111111\n",
-      "0.8487208008898777\n",
+      "0.8286985539488321\n",
-      "0.7789317507418397\n"
+      "0.7648367952522255\n"
     ]
    }
   ],
@@ -531,22 +531,22 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:39.216072Z",
+     "end_time": "2023-05-26T14:22:33.241329Z",
-     "start_time": "2023-05-26T14:17:39.212717Z"
+     "start_time": "2023-05-26T14:22:33.236558Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 940,
+   "execution_count": 985,
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "0.9844444444444445\n",
+      "0.98\n",
-      "0.9599555061179088\n",
+      "0.9655172413793104\n",
-      "0.9473293768545994\n"
+      "0.9443620178041543\n"
     ]
    }
   ],
@@ -567,14 +567,14 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:39.259071Z",
+     "end_time": "2023-05-26T14:22:33.286173Z",
-     "start_time": "2023-05-26T14:17:39.217697Z"
+     "start_time": "2023-05-26T14:22:33.242848Z"
    }
   }
  },
  {
   "cell_type": "code",
-   "execution_count": 941,
+   "execution_count": 986,
   "outputs": [
    {
     "name": "stdout",
@@ -603,8 +603,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:39.267734Z",
+     "end_time": "2023-05-26T14:22:33.292378Z",
-     "start_time": "2023-05-26T14:17:39.263262Z"
+     "start_time": "2023-05-26T14:22:33.287825Z"
    }
   }
  },
@@ -628,7 +628,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 942,
+   "execution_count": 987,
   "outputs": [
    {
     "name": "stdout",
@@ -750,8 +750,8 @@
   "metadata": {
    "collapsed": false,
    "ExecuteTime": {
-     "end_time": "2023-05-26T14:17:39.516773Z",
+     "end_time": "2023-05-26T14:22:33.532304Z",
-     "start_time": "2023-05-26T14:17:39.267995Z"
+     "start_time": "2023-05-26T14:22:33.292768Z"
    }
   }
  },