{"id":39335,"date":"2020-06-17T05:05:50","date_gmt":"2020-06-17T02:05:50","guid":{"rendered":"https:\/\/tentaclii.wordpress.com\/?p=39335"},"modified":"2020-06-17T05:05:50","modified_gmt":"2020-06-17T02:05:50","slug":"nlp-with-lovecraft","status":"publish","type":"post","link":"https:\/\/jurn.link\/tentaclii\/index.php\/2020\/06\/17\/nlp-with-lovecraft\/","title":{"rendered":"NLP with Lovecraft"},"content":{"rendered":"<p>Lovecraft with NLP. No, not the dodgy cultic &#8216;neuro linguistic programming&#8217;. NLP as in proper hardcore computer programming, in the form of &#8216;Natural Language Processing&#8217; for digital humanities work. <em>Towards Data Science<\/em> currently has long articles showing exactly how to have a computer crunch the Lovecraft fiction corpus and thus help to answer questions such as&#8230;<\/p>\n<blockquote><p>Are the stories as negative as we thought? What are the most used adjectives, are they \u201chorrible\u201d and \u201cunknown\u201d and \u201cancient\u201d?<\/p><\/blockquote>\n<p>Ideally the corpus would first be carefully chunked, split into distinct sections relating to his phases and places. Each would be probed separately. It&#8217;s probably big enough to chunk. Otherwise you&#8217;d get a bit of a smushy answer to such questions. &#8220;The Quest of Iranon&#8221; (1921) is not the same beastie as &#8220;The Shadow out of Time&#8221; (1935) etc.<\/p>\n<p><a href=\"https:\/\/towardsdatascience.com\/lovecraft-with-natural-language-processing-part-1-rule-based-sentiment-analysis-5727e774e524\">Lovecraft with NLP: Part 1: Rule-Based Sentiment Analysis<\/a><\/p>\n<p><a href=\"https:\/\/towardsdatascience.com\/lovecraft-with-natural-language-processing-part-2-tokenisation-and-word-counts-f970f6ff5690\">Lovecraft with NLP: Part 2: Tokenisation and Word Counts<\/a><\/p>\n<p>It looks like more parts are planned.<\/p>\n<p><em>Update: <a href=\"https:\/\/towardsdatascience.com\/lovecraft-with-natural-language-processing-part-3-tf-idf-vectors-8c2d4df98621\">Lovecraft with NLP: Part 3: TF-IDF and K-Means Clustering<\/a>. At which point, having seen two articles, you hit the paywall.<\/em><\/p>\n<p><em>Update: <a href=\"https:\/\/towardsdatascience.com\/lovecraft-with-natural-language-processing-part-4-latent-semantic-analysis-70aa2fa2161b\">Lovecraft with NLP: Part 4: Latent Semantic Analysis<\/a>.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Lovecraft with NLP. No, not the dodgy cultic &#8216;neuro linguistic programming&#8217;. NLP as in proper hardcore computer programming, in the &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/tentaclii\/index.php\/2020\/06\/17\/nlp-with-lovecraft\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[],"class_list":["post-39335","post","type-post","status-publish","format-standard","hentry","category-scholarly-works"],"_links":{"self":[{"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/posts\/39335","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/comments?post=39335"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/posts\/39335\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/media?parent=39335"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/categories?post=39335"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/tentaclii\/index.php\/wp-json\/wp\/v2\/tags?post=39335"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}