Jekyll2020-02-27T12:10:50-08:00https://ameya005.github.io/feed.xmlAmeya JoshiI am a PhD. student at the Dept. of Electrical and Computer Engineering, New York University. I primarily work on generative deep networks and robust machine learning for deep networks.Ameya Joshiameya@iastate.eduFuture Blog Post2199-01-01T00:00:00-08:002199-01-01T00:00:00-08:00https://ameya005.github.io/posts/2012/08/future-post<p>This post will show up by default. To disable scheduling of future posts, edit <code class="language-plaintext highlighter-rouge">config.yml</code> and set <code class="language-plaintext highlighter-rouge">future: false</code>.</p>Ameya Joshiameya@iastate.eduThis post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.Deep Networks: A Rebooot2016-07-23T20:02:22-07:002016-07-23T20:02:22-07:00https://ameya005.github.io/artificial%20intelligence/image%20processing%20&amp;%20computer%20vision/deep-networks-a-rebooot<p>It’s been ages since I last posted here. But it is time to reboot this blog.</p>
<p>Since, i have always been all about computer vision, and my current job involves using the ‘deep’ and ‘wide’ now to solve medical imaging problems, here I am back to talk about deep learning, recent advances in computer vision and my trysts with it.</p>
<p>So, this being a quick update post, I will keep it short… but do await the new series of posts on me tangling with the latest and greatest of computer vision research.</p>ameya005It’s been ages since I last posted here. But it is time to reboot this blog.How ‘Deep’ runs The rabbit hole2016-07-23T19:53:30-07:002016-07-23T19:53:30-07:00https://ameya005.github.io/how-deep-runs-the-rabbit-hole<p>…And with what probability?</p>
<p>Tomas Malisiewicz talks about the balance of inference and uncertainty in a new article.</p>
<p>http://www.computervisionblog.com/2016/06/making-deep-networks-probabilistic-via.html?m=1</p>ameya005…And with what probability?Kernels Part 1: What is an RBF Kernel? Really?2015-01-28T01:33:22-08:002015-01-28T01:33:22-08:00https://ameya005.github.io/kernels-part-1-what-is-an-rbf-kernel-really<p>An intriguing article. To look at an RBF kernel as a low pass filter is something novel. It also basically shows why RBF kernels work brilliantly on high dimensional images. Given that your image features generally lie in a continuous domain, an RBF kernel generally can fit smooth solutions and thereby create more relevant separating hyperplanes,especially in case of multiple classes.</p>ameya005An intriguing article. To look at an RBF kernel as a low pass filter is something novel. It also basically shows why RBF kernels work brilliantly on high dimensional images. Given that your image features generally lie in a continuous domain, an RBF kernel generally can fit smooth solutions and thereby create more relevant separating hyperplanes,especially in case of multiple classes.An Interesting History of Computer Vision2015-01-05T17:33:01-08:002015-01-05T17:33:01-08:00https://ameya005.github.io/an-interesting-history-of-computer-vision<p>Dr. Fei Fei Li from Stanford discusses the advent and growth of computer vision in recent years. Particularly intersting is her recent research on multimodal interactions and large scale visual recognition. This has been primarily made possible due to the growth in GPU technology. I hope to try out Theano and Caffe for deep learning in this scenario soon.</p>
<p>Video:
[youtube https://www.youtube.com/watch?v=ylVsqXzlJqA&amp;w=560&amp;h=315]</p>
<p>Recent Publications from L. Fei Fei’s group:</p>
<p>http://vision.stanford.edu/publications.html</p>ameya005Dr. Fei Fei Li from Stanford discusses the advent and growth of computer vision in recent years. Particularly intersting is her recent research on multimodal interactions and large scale visual recognition. This has been primarily made possible due to the growth in GPU technology. I hope to try out Theano and Caffe for deep learning in this scenario soon.Giving eyes to a micro controller: DCMI interface on an STM32F42014-09-22T02:37:44-07:002014-09-22T02:37:44-07:00https://ameya005.github.io/image%20processing%20&amp;%20computer%20vision/giving-eyes-to-a-micro-controller-dcmi-interface-on-an-stm32f4<p>It has been another long hiatus between posts. But, I have managed to learn and do quite a bit of stuff in these last few months and it has been rewarding to say the least.</p>
<p>Recently, I have had to work on an embedded platform for image processing. It was quite a big deal as I had never worked with any sort of embedded platform before and the kind of work is quite different from what I have done before. So, my first task was to interface a camera with a microcontroller. After consulting with my friends, <a href="http://embeddlinux.blogspot.in/">Shrenik</a> and <a href="http://blog.vinu.co.in/">Vinod</a>, I decided to use a microcontroller which provides a hardware camera interface instead of writing the complete firmware from scratch for an ATMEGA as I had planned on doing earlier.</p>
<p>After some research, I ended up selecting the well known STM32F4 series of microcontrollers. The <a href="http://www.st.com/web/catalog/mmc/FM141/SC1169/SS1577/LN11">STM32F407</a> is a high powered μC with an ARM Cortex M4 processor running at 168 MHz. The development board available has 1 Mb of onchip flash and 192 kB of SRAM. After playing with GBs of RAM, it sure was tough to be excited about a few kB of SRAM, but it was a different challenge to solve the problem using as few resources as possible. The STM32F407 features a a hardware camera interface known as DCMI ( Digital Camera Interface). It is compatible with a huge range of camera modules on the market.</p>
<p>I had also decided to use the <a href="http://www.uctronics.com/ov2640-2mp-hd-cmos-camera-module-adapter-board-jpeg-out-p-1442.html">OV2640 camera module</a> as it features an on-board JPEG encoder and is quite well documented. After a few days of familiarizing with the basic concepts and fiddling around with the standard peripherals library from STM, I came across this amazing implementation, <a href="http://www.google.co.in/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=1&amp;cad=rja&amp;uact=8&amp;ved=0CB4QFjAA&amp;url=http%3A%2F%2Fhackaday.io%2Fproject%2F1313-OpenMV&amp;ei=BuofVNDQH5WVuATc64KgAg&amp;usg=AFQjCNEe8m_KYv2zyseP90eyYpNXlxySdw&amp;sig2=8eqVshIbx1sIroXtI7aIzw&amp;bvm=bv.75775273,d.c2E">OpenMV</a>. I found it extremely helpful to understand the intricacies of image processing on embedded systems.</p>
<p>The primary issues that I faced while working on this project were:</p>
<ul>
<li>
<p>Understanding DCMI and DMA interfaces of the STM32 controller.</p>
</li>
<li>
<p>Understanding the communication and synchronization between the camera and the controller.</p>
</li>
<li>
<p>Clocks and STM’s unique proposition of allowing us to turn off peripheral clocks when required for low power usage.</p>
</li>
</ul>
<p>I am going to be blogging about my experiences regarding my foray into the world of micro-controllers and camera control soon. Until then, here are a few images that I captured with my setup.</p>
<p><a href="https://ameyajoshi005.files.wordpress.com/2014/09/test4.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2014/09/test4.jpg" alt="test4" /></a><a href="https://ameyajoshi005.files.wordpress.com/2014/09/test2.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2014/09/test2.jpg" alt="test2" /></a></p>ameya005It has been another long hiatus between posts. But, I have managed to learn and do quite a bit of stuff in these last few months and it has been rewarding to say the least.Back to Basics: Sparse Coding?2014-07-30T11:01:45-07:002014-07-30T11:01:45-07:00https://ameya005.github.io/back-to-basics-sparse-coding<p>A good introduction to Sparse Coding. Hope to do some stuff regarding this in the future.</p>ameya005A good introduction to Sparse Coding. Hope to do some stuff regarding this in the future.VLAD- An extension of Bag of Words2014-03-29T09:56:00-07:002014-03-29T09:56:00-07:00https://ameya005.github.io/image%20processing%20&amp;%20computer%20vision/vlad-an-extension-of-bag-of-words<p>Recently, I was a participant at TagMe- an image categorization competition conducted by Microsoft and Indian Institute of Science, Bangalore. The problem statement was to classify a set of given images into five classes: faces, shoes, flowers, buildings and vehicles. As it goes, it is not a trivial problem to solve. So, I decided to attempt my existing bag-of-words algorithm on that. It worked to an extent, I got an accuracy of 86% approximately with SIFT features and an RBF SVM for classification. In order to improve my score though, I decided to look at better methods of feature quantization. I had been looking at VLAD (Vector of Locally Aggregated Descriptors): A first order extension to BoW for my Leaf Recognition project.</p>
<p>So, I decided to attempt to use VLAD using OpenCV and implemented a small function based on the BoW API currently in OpenCV for VLAD. The results showed remarkable improvement with an accuracy of 96.5 % using SURF descriptors on teh validation dataset provided by the organizers.</p>
<h3 id="what-is-vlad"><strong>What is VLAD</strong></h3>
<p>Recalling BoW, it involved simply counting the no. of descriptors associated with each cluster in a codebook(vocabulary) and creating a histogram for each set of descriptors from an image, thus representing the information in a an image in a compact vector. VLAD is an extension of this concept. We accumulate the residual of each descriptor with respect to its assigned cluster. In simpler terms, we match a descriptor to its closest cluster, then for each cluster, we store the sum of the differences of the descriptors assigned to the cluster and the centroid of the cluster. Let us have a look at the math behind VLAD..</p>
<h3 id="mathematical-formulation"><strong>Mathematical Formulation</strong></h3>
<p>As with bag of words, we first train a codebook from the descriptors from our training dataset, as $latex C={c_1,c_2,…c_k}$ where $latex k$ is the no. of clusters in K-means. We then associate each $latex d$-dimensional local descriptor, $latex x$ from an image with its nearest neighbour in the codebook.</p>
<p>The idea behind VLAD feature quantization is that, for each cluster centroid, $latex c_i$, we accumulate the difference $latex x-c_i$ where for each $latex x$, $latex c_i = NN(x)$</p>
<p>Representing the VLAD vector for each image by $latex v$, we have,</p>
<table>
<tbody>
<tr>
<td>$latex v_{ij} =\sum_{x</td>
<td>x=NN(c_i)} {(x_j - c_{ij})}$</td>
</tr>
</tbody>
</table>
<p>where $latex i=1,2,3…k$ and $latex j=1,2,3..d$</p>
<p>The vector $latex v$ is subsequently normalized with its $latex L_2$ norm as $latex v=\frac{v}{|v|_2}$</p>
<h3 id="comparison-with-bow"><strong>Comparison with BoW</strong></h3>
<p>The primary advantage of VLAD over BoW is that we add more discriminative property in our feature vector by adding the difference of each descriptor from the mean in its voronoi cell. This first order statistic adds more information in our feature vector and hence gives us better discrimination for our classifier. This also points us to other improvements we can adding higher order statistics to our feature encoding as well as looking at soft assignment,i,e. assigning each descriptor multiple centroids weighed by their distance from the descriptor.</p>
<h3 id="experiments"><strong>Experiments</strong></h3>
<p>Here are a few of my results on the TagMe dataset.</p>
<p><a href="http://ameyajoshi005.files.wordpress.com/2014/03/results.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2014/03/results.jpg?w=620" alt="results" /></a></p>
<p>###</p>
<p>###</p>
<p>###</p>
<p>###</p>
<h3 id="improvements-to-vlad">Improvements to VLAD:</h3>
<p>There are several extension possible for VLAD, primarily various normalization options. Arandjelov and Zissermann in their paper, A<a href="http://www.robots.ox.ac.uk/~vgg/publications/2013/arandjelovic13/?update=1">ll about VLAD,</a> propose several normalization techniques, including intra normalization and power normalization alonging with a spatial extension - MultiVLAD. <a href="http://delivery.acm.org/10.1145/2510000/2502171/p653-delhumeau.pdf?ip=115.248.130.148&amp;id=2502171&amp;acc=ACTIVE%20SERVICE&amp;key=045416EF4DDA69D9.B8A9898014426BF2.4D4702B0C3E38B35.4D4702B0C3E38B35&amp;CFID=309703199&amp;CFTOKEN=17981182&amp;__acm__=1396111162_07a913959d90189257582343022a03ff">Delhumeau et al</a>, propose several different normalization techniques as well as a modification to the VLAD pipeline to show improvements to almost state of the art.</p>
<p>Other references also stress on spatial pooling i.e. dividing your image into regions to get multiple VLAD vectors for each tile to better represent local features and spatial structure. A few also advise soft assignment, which refers to assignment of descriptors to multiple clusters, weighed by their distance from the cluster.</p>
<h3 id="code">Code:</h3>
<p>Here is a link to my code for TagMe. It was a quick has job for testing so it is not very clean though I am going to clean it up soon.</p>
<p><a href="https://github.com/ameya005/VLAD-Implementation"><strong>https://github.com/ameya005/VLAD-Implementation</strong></a></p>
<p>also, a few references for those who want to read the papers I referred:</p>
<p>1.<a href="http://hal.archives-ouvertes.fr/docs/00/54/86/37/PDF/jegou_compactimagerepresentation.pdf">Jégou, H., Perronnin, F., Douze, M., Sánchez, J., Pérez, P., &amp; Schmid, C. (2012). Aggregating local image descriptors into compact codes. <em>Pattern Analysis and Machine Intelligence, IEEE Transactions on</em>, <em>34</em>(9), 1704-1716.</a></p>
<ol>
<li>
<p><a href="http://dl.acm.org/citation.cfm?id=2502171">Delhumeau, J., Gosselin, P. H., Jégou, H., &amp; Pérez, P. (2013, October). Revisiting the VLAD image representation. In <em>Proceedings of the 21st ACM international conference on Multimedia</em> (pp. 653-656). ACM.</a></p>
</li>
<li>
<p><a href="http://courses.cs.washington.edu/courses/cse590v/13au/arandjelovic13.pdf">Arandjelovic, R., &amp; Zisserman, A. (2013, June). All about VLAD. In <em>Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on</em> (pp. 1578-1585). IEEE.</a></p>
</li>
</ol>
<p>Well, that concludes this post. Be on the lookout for more on image retrieval - improvements to VLAD and Fisher Vectors.</p>ameya005Recently, I was a participant at TagMe- an image categorization competition conducted by Microsoft and Indian Institute of Science, Bangalore. The problem statement was to classify a set of given images into five classes: faces, shoes, flowers, buildings and vehicles. As it goes, it is not a trivial problem to solve. So, I decided to attempt my existing bag-of-words algorithm on that. It worked to an extent, I got an accuracy of 86% approximately with SIFT features and an RBF SVM for classification. In order to improve my score though, I decided to look at better methods of feature quantization. I had been looking at VLAD (Vector of Locally Aggregated Descriptors): A first order extension to BoW for my Leaf Recognition project.SuperPixels: SLIC and more2013-12-03T23:32:41-08:002013-12-03T23:32:41-08:00https://ameya005.github.io/posts/2013/12/04/superpixels-slic-and-more<p>Hey guys,</p>
<p>I recently came across a very cool computer vision concept - SuperPixels. Superpixels are what you would call aggregations of pixels with common features - for example, color, illumination, absorbance, spatial location etc. It is perhaps the precursor to full object segmentation. Here, I am going to discuss a specific kind of superpixels called SLIC.</p>
<p>SLIC stands for Simple Linear Iterative Clustering. The name defines it all as it is just a simple representation of clustering using k-means. This was first proposed in the following paper:</p>
<p><a href="http://infoscience.epfl.ch/record/149300/files/SLIC_Superpixels_TR_2.pdf">Radhakrishna Achanta, Appu Shaji, Kevin Smith, Aure-lien Lucchi, Pascal Fua, and Sabine Susstrunk, SLIC Superpixels, EPFL Technical Report 149300, June 2010.</a></p>
<p>The basic idea of superpixels is that they contain a lot more high level information as compared to just pixels. Pixels give extremely low level and local information which many-a-times cannot be used without heavy duty processing. Instead, superpixels provide less localized information which makes more sense to have considering problems like segmentation of objects. A simple analogy would be to step back a bit and look at a slightly bigger picture.</p>
<p>To start of with, lets look at a few example images given in the paper:</p>
<p><a href="http://ameyajoshi005.files.wordpress.com/2013/09/slic1.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/09/slic1.jpg?w=300" alt="slic1" /></a></p>
<p><a href="http://ameyajoshi005.files.wordpress.com/2013/09/slic2.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/09/slic2.jpg?w=200" alt="slic2" /></a></p>
<p>As you can see, pixels are clustered according to their location and color. The different slices represent different sizes of superpixels. One of the major advantages of using SLIC superpixels is that they show low undersegmentation and high boundary recall. It is one of the most efficient methods in the methods for getting superpixels.</p>
<p>The algorithm is very intuitive. We start off by converting the image to L<em>a</em>b* color-space. We then create feature points from each pixel consisting of the L<em>,a</em>,b* values and the x and y co-ordinates. These features are then clustered using <a href="http://en.wikipedia.org/wiki/K-means_clustering">k-means clustering</a><a href="http://en.wikipedia.org/wiki/K-means_clustering">. </a></p>
<p>Now, one of the brilliant things of SLIC is that it takes the x-y co-ordinates into the picture while clustering. So, you get uniform and well defines superpixels. This can then be further used for segmentation. The distance function used for clustering is given by</p>
<p><a href="http://www.codecogs.com/eqnedit.php?latex=d_{lab}&space;=&space;\sqrt{&space;(l_{k}&space;-&space;l_{i})^{2}&space;&plus;&space;(a_{k}&space;-&space;a_{i})^{2}&space;&plus;(b_{k}&space;-b_{i})^{2}&space;}"><img src="http://latex.codecogs.com/png.latex?d_{lab}&space;=&space;\sqrt{&space;(l_{k}&space;-&space;l_{i})^{2}&space;&plus;&space;(a_{k}&space;-&space;a_{i})^{2}&space;&plus;(b_{k}&space;-b_{i})^{2}&space;}" alt="" /></a></p>
<p><a href="http://www.codecogs.com/eqnedit.php?latex=d_{xy}&space;=&space;\sqrt{&space;(x_{k}&space;-&space;x_{i})^{2}&space;&plus;&space;(y_{k}&space;-&space;y_{i})^{2}&space;}"><img src="http://latex.codecogs.com/png.latex?d_{xy}&space;=&space;\sqrt{&space;(x_{k}&space;-&space;x_{i})^{2}&space;&plus;&space;(y_{k}&space;-&space;y_{i})^{2}&space;}" alt="" /></a></p>
<p><a href="http://www.codecogs.com/eqnedit.php?latex=D_{S}&space;=&space;d_{lab}&space;&plus;&space;\frac{m}{S}d_{xy}"><img src="http://latex.codecogs.com/png.latex?D_{S}&space;=&space;d_{lab}&space;&plus;&space;\frac{m}{S}d_{xy}" alt="" /></a></p>
<p>So, we start off with K equally spaced clusters all across the image. These clusters are centred around the lowest gradient position in a 3x3 neighbourhood around the equally spaced points. We do this to avoid putting them at edges and to reduce the probability of choosing a noisy pixel. We calculate the image gradients as follows:</p>
<p><img src="http://www.sciweavers.org/tex2img.php?eq=G%28x%2Cy%29%20%3D%20%20%5C%7C%20I%28x%2B1%2Cy%29%20-%20I%28x-1%2Cy%29%20%5C%7C%5E2%20%2B%20%5C%7CI%28x%2Cy%2B1%29%20-%20I%28x-1%2Cy%29%5C%7C%5E2&amp;bc=White&amp;fc=Black&amp;im=jpg&amp;fs=12&amp;ff=modern&amp;edit=0" alt="G(x,y) = | I(x+1,y) - I(x-1,y) |^2 + |I(x,y+1) - I(x-1,y)|^2" /></p>
<p>where $latex I (x,y)$ is the lab vector corresponding to the pixel and $latex |.|$ is the L2 norm.
Each pixel in the image is then associated with the nearest cluster centre in the $latex labxy$ feature space. After all pixels are associated with one or the other cluster, we again calculate a new cluster centre as the average value of the $latex labxy$ features in each cluster. We then repeat the process of clustering and recalculating until we achieve convergence.</p>
<p>As a result, we will have nicely clustered groups of pixels, both in color as well as spatial domain. Now, there might be a few stray pixels that may have got clustered in the wrong clusters. Though this is pretty rare according to the authors of the paper and my experiments, we enforce a connectivity constraint on them in order to remove these outliers. The paper doesnot mention this in any great detail, but connectivity is an important condition to impose, especially if we are going to use SLIC superpixels for segmentation.</p>
<p>Thats it for the explanation of an amazing technique, folks! I will post my OpenCV code link in a few days as well as will discuss a few issues with the code and segmentation techniques using these in the next few posts.</p>
<p>P.S.:By the way, we will also be looking at Bag of Words in a little more detail in a few more days. Be on the look out for a barrage of posts.</p>ameya005Hey guys,Bag of Words - Object Recognition2013-08-01T00:41:00-07:002013-08-01T00:41:00-07:00https://ameya005.github.io/posts/2013/08/01/bag-of-words-object-recognition<p>Hey guys,</p>
<p>Its been a really long time since my last post. But this series of posts is going to be a really cool I hope.</p>
<p>Today we are going to discuss one of the most important problems in Computer Vision- <strong>Object Recognition</strong>. We humans tend to trivially recognize objects without consciously paying attention to the fact or even wondering how exactly do we achieve this. You look at a baseball flying towards your face, you recognize it as a baseball about to break your nose, and you duck! All in a matter of a few microseconds.</p>
<p>But the process that your brain undertakes in those few microseconds has eluded perfect implementation in computation for several years now. Object recognition is perhaps, rightly considered the primary problem in computer vision. But recent research advances have made strides in this matter.</p>
<p>I recently undertook a project in which I had to classify leaves into species they come from. And as it sounds, it’s not really a trivial problem. It took me a few days to figure out the first steps to such a process. And to start of with I decided to use the Bag-of Words model, a highly cited method for scene and object classification for the above problem.</p>
<p>To begin with, I found a really nice dataset to work with here: http://flavia.sourceforge.net/ . The dataset contains images for 32 species of leaves on plain white backgrounds which simplified my experiment. I am really grateful to them for providing such a comprehensive dataset for free on the web. (Kinda all for <a href="http://en.wikipedia.org/wiki/Open_access">Open Access</a> now.).</p>
<p>Bag of words is a basically a simplified representation of an image. Its actually a concept taken form Natural Language Processing where you represent documents as an unordered collection of words disregarding grammar. Translating this into CV jargon, it means that we simplify images by picking out features from an image and representing it as a collection of features. A good explanation of what features are can be found at my friend, Siddharth’s blog <a href="http://algorithmicthoughts.wordpress.com/2013/01/19/computer-vision-feature-detection/">here.</a></p>
<p>To get more technical about the BoW- we construct a vocabulary of features. We then use this vocabulary to create histograms from features for each image and then use a simple machine learning algorithm like SVM or Naive Bayes for classification.</p>
<p>This is the algorithm I followed for BoW. I got a lot of help from Roy’s blog <a href="http://www.morethantechnical.com/2011/08/25/a-simple-object-classifier-with-bag-of-words-using-opencv-2-3-w-code/">here.</a></p>
<ol>
<li>
<p>We pick out features from all the images in our training dataset. I used SIFT (Scale Invariant Feature Transform).</p>
</li>
<li>
<p>We cluster these features using any clustering algorithm. I used K-Means. (Pretty fast in OpenCV)</p>
</li>
<li>
<p>We use the cluster as a vocabulary to construct histograms. We simply count the no. of features from each image belonging to each cluster. Then we normalize the histograms by dividing it with the no. of features. Therefore, each image in the dataset is represented by one histogram.</p>
</li>
<li>
<p>Then these histograms are passed to an SVM for training. I currently use a Radial Basis function multi-class SVM in OpenCV. Using OpenCV’s CvSVM::train_auto() function, we get parameters for the SVM using cross validation.</p>
</li>
</ol>
<p>Now why does Bag of Words work? Why use it rather than simple feature matching? The answer to that question is simple: features provide just local information. Using the bag-of-words model we create a global representation of an object. Thus, we take a group of features, create a representation of the image in a simpler form and classify it.</p>
<p>That was for the pros of the algorithm. But there are a few cons associated with this model.</p>
<ol>
<li>
<p>As evident, we cannot localize an object in an image using this model. That is to say, the problem of finding where the object of interest lies is still open and needs other methods.</p>
</li>
<li>
<p>We neglect grammar. In CV terms, it means we neglect the position of features relative to each other. Thus the concept of a global shape maybe lost.</p>
</li>
</ol>
<p>As for our Leaf Recognizer, we are still working on improving the accuracy. We are almost at our goal! The following are some of the images we got as a result of the above algorithm:</p>
<p><a href="http://ameyajoshi005.files.wordpress.com/2013/08/1236.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/1236.jpg?w=300" alt="Test_image_1236" /></a><a href="http://ameyajoshi005.files.wordpress.com/2013/08/1242.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/1242.jpg?w=300" alt="test_image_1242" /></a></p>
<p><a href="http://ameyajoshi005.files.wordpress.com/2013/08/3176.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/3176.jpg?w=300" alt="Test_image_3176" /></a><a href="http://ameyajoshi005.files.wordpress.com/2013/08/3177.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/3177.jpg?w=300" alt="Test_image_3177" /></a><a href="http://ameyajoshi005.files.wordpress.com/2013/08/1373.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/1373.jpg?w=300" alt="test_image_1373" /></a><a href="http://ameyajoshi005.files.wordpress.com/2013/08/1378.jpg"><img src="http://ameyajoshi005.files.wordpress.com/2013/08/1378.jpg?w=300" alt="test_image_1378" /></a></p>ameya005Hey guys,