{"id":4,"date":"2024-02-09T03:29:23","date_gmt":"2024-02-09T03:29:23","guid":{"rendered":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/?p=4"},"modified":"2024-02-09T03:29:23","modified_gmt":"2024-02-09T03:29:23","slug":"my-computer-vision-journey-thus-far","status":"publish","type":"post","link":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/2024\/02\/09\/my-computer-vision-journey-thus-far\/","title":{"rendered":"My Computer Vision journey, thus far."},"content":{"rendered":"\n<p>In CS467, when I selected to work on a project that described the work as development of &#8220;an Interactive Web application,&#8221; I didn&#8217;t imagine it would involve anything to do with the field of Computer Vision. Prior to enrolling in the course, I spent sometime dusting off my javascript prowess thinking I could elect to work on a project that would have me honing those skills on the front-end of some web based app or potentially learning some commercially relevant web framework like Angular JS. What I didn&#8217;t know is that my expectations were about to be trashed. I don&#8217;t mean that in a bad way, it just wasn&#8217;t the trajectory I expected and the last thing I anticipated was to work on was training models for object detection, classification, or segmentation. <\/p>\n\n\n\n<p>From our group&#8217;s the initial meeting, I understood that the Webapp was not going to be the focus of our endeavor for the quarter. Words like &#8220;AI&#8221; were being used in conversation and acronyms like &#8216;LLM&#8217; and &#8220;neural network&#8221; were touched on, and I had a sinking feeling that I was out of my depth. Without clear direction, I reached for any viable resource I had to lean on to bring myself up to speed on what training a custom model involved. I quickly understood Python was going to be leveraged and while many CS students may feel comfortable with the language&#8217;s syntax; I had successfully navigated my course load without much need of it. I felt uneasy going through several tutorials prior to setting up my own testing environment where I trained my own custom model repeatedly. The process was slow (very slow). But from that point, I started to tinker with various options and parameters to tweak the results of my model\u2019s output. I can\u2019t get into specifics for fear of violating an NDA. But I began to see a workflow come together I and spent some time afterwards documenting my steps of procuring images, annotating those images, training the custom model, and eventually testing. Presently the training process moves a bit quicker than before, and I\u2019ve streamlined acquiring the image files I need to my local machine; thanks to some scripting tools. But the tedious process of annotation is rather time consuming, and I am starting to see why certain services like Amazon SageMaker Ground Truth in AWS have sprung up and are charging a premium for the service. Where things go from here I don\u2019t know. I assume I&#8217;ll one day be grateful for the exposure into this unique technology the project has brought me. But it&#8217;s a bit too early in the quarter to see the proverbial &#8220;light at the end&#8221; and I need to keep things moving.    \u00a0\u00a0      <\/p>\n","protected":false},"excerpt":{"rendered":"<p>In CS467, when I selected to work on a project that described the work as development of &#8220;an Interactive Web application,&#8221; I didn&#8217;t imagine it would involve anything to do with the field of Computer Vision. Prior to enrolling in the course, I spent sometime dusting off my javascript prowess thinking I could elect to [&hellip;]<\/p>\n","protected":false},"author":14073,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-4","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/posts\/4","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/users\/14073"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/comments?post=4"}],"version-history":[{"count":1,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/posts\/4\/revisions"}],"predecessor-version":[{"id":6,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/posts\/4\/revisions\/6"}],"wp:attachment":[{"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/media?parent=4"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/categories?post=4"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/mattfredericks\/wp-json\/wp\/v2\/tags?post=4"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}