{"id":770,"date":"2021-08-04T13:34:46","date_gmt":"2021-08-04T20:34:46","guid":{"rendered":"https:\/\/wou.edu\/linguistic-landscape-corpus\/?page_id=770"},"modified":"2024-10-17T15:50:04","modified_gmt":"2024-10-17T22:50:04","slug":"faq","status":"publish","type":"page","link":"https:\/\/wou.edu\/linguistic-landscape\/faq\/","title":{"rendered":"LL Corpus FAQ"},"content":{"rendered":"\n[et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;section&#8221; _builder_version=&#8221;4.16&#8243; custom_padding=&#8221;0px|||||&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row admin_label=&#8221;row&#8221; _builder_version=&#8221;4.16&#8243; background_size=&#8221;initial&#8221; background_position=&#8221;top_left&#8221; background_repeat=&#8221;repeat&#8221; custom_padding=&#8221;0px|||||&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; custom_padding=&#8221;|||&#8221; global_colors_info=&#8221;{}&#8221; custom_padding__hover=&#8221;|||&#8221;][et_pb_text _builder_version=&#8221;4.19.3&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<h2>LL Corpus FAQ<\/h2>\n[\/et_pb_text][et_pb_toggle title=&#8221;What is the LL Corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; _module_preset=&#8221;default&#8221; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; global_colors_info=&#8221;{}&#8221; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The LL Corpus is the full text of 383 published journal articles (1997-2017) and 165 book chapters (2008-2018) for a total of 548 items. The LL Corpus is a freely available resource intended for use by Linguistic Landscape scholars and students as well as corpus linguists. The CQPweb search interface enables users to perform anything from simple word searches to advanced corpus analysis. The LL Corpus does not infringe on copyright restrictions because the results of searches only display 200-words of text from any item. Links in the metadata for each item will direct users to the DOI or URL for the publication so that users can access the full text via institutional or individual methods.<\/span><\/li>\n<\/ul>\n<ul>\n<li><span class=\"tadv-color\"><strong>The LL Corpus is not<\/strong><span>\u00a0<\/span>a substitute for the original articles and chapters that it contains.<\/span>\n<ul>\n<li><span class=\"tadv-color\"><strong>Tables, charts, figures, numerals and non-English text will be formatted differently, removed, or reproduced inaccurately<\/strong><span>\u00a0<\/span>due to the conversion process from published text to corpus-searchable plain text; however, every reasonable effort was taken to make the English text as accurate as possible.<\/span><\/li>\n<li><span class=\"tadv-color\">Many of the works contained in the LL Corpus are under<span>\u00a0<\/span><strong>copyright<span>\u00a0<\/span><\/strong>and\/or not freely available. For this reason, users can only see 100 words to the left and 100 words to the right of a search term\/phrase of each text\u2014the full texts are not available. However, the links provided in the metadata should take you stable webpages where you can see either the full text or information on how to purchase or find institutional access to items.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How do I cite the LL Corpus? (APA)&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">Troyer, Robert A. (2021). Linguistic Landscape Corpus. CQPweb at Lancaster. https:\/\/cqpweb.lancs.ac.uk\/llscape202107\/<\/span><span class=\"tadv-color\"><\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;What details about the LL Corpus should I be aware of?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">In compiling the corpus, reasonable efforts were taken to ensure that the text of each article and chapter are accurate reflections of the published texts. Some details to be aware of:<\/span>\n<ul>\n<li><span class=\"tadv-color\"><span class=\"tadv-background-color\"><strong>spellings<span>\u00a0<\/span><\/strong>were not standardized\u2013British and American variants remain as they were in the original articles, and any non-standard or infrequently used spellings (or misspellings) of words were maintained; thus, if you want to retrieve examples of both \u201cneighborhood\u201d and \u201cneighbourhood\u201d you will need to specify both forms in your search using parentheses and the alternative symbol | so that the search is typed as (neighborhood|neighbourhood). The corpus is lemmatized and lemma searches can be incorporated into alternates so that ({neighborhood}|{neighbourhood}) will retrieve both the singular and plural of both spelling variants.<\/span><\/span><\/li>\n<li><span class=\"tadv-color\">we attempted to remove all<span>\u00a0<\/span><strong>hyphens<span>\u00a0<\/span><\/strong>in the original text<span>\u00a0<\/span><strong>that were used when a word was divided at the end of a line of printed text<\/strong><span>\u00a0<\/span>so that in the corpus the word appears as a whole (and can be found in searches); however, some of these divided words might have gone undetected, and some intentional hyphens (in compound words that have optional hyphens, and in long URLs) may have been deleted in the process. These minor inconsistencies should not be statistically significant or detract from the usability of the corpus; however, if you notice mistakes, please let us know so that we can fix them in a future version.<\/span><\/li>\n<li><span class=\"tadv-color\">as stated above, whenever possible the text in<span>\u00a0<\/span><strong>tables, charts, and figures<\/strong><span>\u00a0<\/span>was maintained though not in the original format; however, when the words in illustrations were part of image files (not printed text) in the originals, the words could not be included in the corpus files.<\/span><\/li>\n<li><span class=\"tadv-color\"><strong>images\/figures<\/strong><span>\u00a0<\/span>in original files and any linguistic items in the images are not included in the corpus files, but every attempt was made to include the labels and captions for all of these items so that they can be found in searches. As stated above, text in languages other than English, especially those that are not written in Latin\/Roman script, will vary greatly in accuracy and completeness in the corpus. Furthermore, all part of speech tagging and lemmatization is based on English structure. Creating a functioning multilingual corpus is beyond the scope of this project, so please rely only on the English text for search and analysis of the corpus while keeping in mind that some foreign words may have been automatically tagged with English part of speech tags.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;What is the purpose of the LL corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The primary aim of the LL Corpus is to enable LL scholars to find more detailed and accurate information from previous studies than is available from the LL Bibliography or from large academic databases that include publications that are not related to the field of Linguistic Landscape Studies.<\/span><\/li>\n<li><span class=\"tadv-color\">Another aim is to encourage the democratization of LL research by making the work of lesser-cited authors just as accessible as that of more frequently cited scholars. When users perform a search for words or phrases they are interested in, they will obtain results from any and all publications in the corpus, and it is our hope that this leads scholars to publications they would not otherwise have discovered.<\/span><\/li>\n<li><span class=\"tadv-color\">Because the LL Corpus contains metadata categories for year of publication, the corpus can be used to explore historical developments in the field of LL Studies. Similarly, the metadata category of publication type (article or book chapter) allows for comparisons between these two major publishing venues. From a corpus linguistics perspective, it is rarely feasible to create a specialized, discipline-specific corpus of publications that is highly representative of an academic field (See the following section for representativeness). Because the LL Corpus is hosted on the CQPweb server, users can perform genre studies\u2014for example, keyword analysis of the LL Corpus in comparison to the British National Corpus or to any of the other publicly available corpora that are also on CQPweb.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How representative is the corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The<span>\u00a0<\/span><a rel=\"noopener noreferrer\" href=\"https:\/\/www.zotero.org\/groups\/216092\/linguistic_landscape_bibliography?\" data-type=\"URL\" data-id=\"https:\/\/www.zotero.org\/groups\/216092\/linguistic_landscape_bibliography?\" target=\"_blank\">LL Bibliography<\/a><span>\u00a0<\/span>on Zotero contains complete reference information for 1115 items (books, book chapters, journal articles, dissertations and theses, reports, and the annual LL Workshops). The LL Bibliography lists 427 journal articles from 1997-2017 and 247 book chapters from 2006-2018\u2013the LL Corpus contains the full text of 383 of these articles (90%) and 165 chapters (67%) respectively from those years; thus, the LL Corpus contains approximately 80% of LL publications in journals and books during the respective periods. It is worth noting that the 10% of journal articles not included in the corpus are typically ones that were very difficult to access while the 33% of book chapters that were not included were present in a very wide variety of volumes the whole of which were not focused on LL Studies. On the other hand, the complete texts of the following edited collections of LL work are included in the LL Corpus.<\/span>\n<ul>\n<li><span class=\"tadv-color\"><em>Linguistic Landscape: Expanding the Scenery<\/em>. 2009<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Linguistic Landscape in the City<\/em>. 2010<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Semiotic Landscapes: Language, Image, Space<\/em>. 2010<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Linguistic Landscapes, Multilingualism and Social Change<\/em>. 2012<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Minority Languages in the Linguistic Landscape<\/em>. 2012<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Conflict, Exclusion and Dissent in the Linguistic Landscape<\/em>. 2015<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Negotiating and Contesting Identities in Linguistic Landscapes<\/em>. 2016<\/span><\/li>\n<li><span class=\"tadv-color\"><em>Expanding the Linguistic Landscape<\/em>. 2018<\/span><\/li>\n<\/ul>\n<\/li>\n<li><span class=\"tadv-color\">A complete list of the metadata for each item in the LL Corpus is available<span>\u00a0<\/span><strong><a href=\"https:\/\/wou.edu\/linguistic-landscape\/files\/2021\/08\/LL_Corpus_Metadata_2021.xlsx\" target=\"_blank\" rel=\"noopener\">as an Excel file<\/a><\/strong><span>\u00a0<\/span>as well as<span>\u00a0<\/span><strong><a href=\"https:\/\/wou.edu\/linguistic-landscape\/files\/2021\/08\/LL_Corpus_items_2021.pdf\" target=\"_blank\" rel=\"alternate noopener\">in a pdf<\/a><span>\u00a0<\/span><\/strong>organized alphabetically by author\u2019s last name.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How do I access the LL Corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The LL Corpus is only available through CQPweb. See the<span>\u00a0<\/span><a href=\"https:\/\/wou.edu\/linguistic-landscape\/sign-up-for-access\/\" data-type=\"page\" data-id=\"768\">Sign Up For Access<\/a><span>\u00a0<\/span>page.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How do I search and analyze the LL Corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The best resource for learning how to search and analyze the corpora that are available on CQPweb is the<span>\u00a0<\/span><a rel=\"noreferrer noopener\" href=\"https:\/\/cqpweb.lancs.ac.uk\/usr\/help.php?ui=hello\" data-type=\"URL\" data-id=\"https:\/\/cqpweb.lancs.ac.uk\/usr\/help.php?ui=hello\" target=\"_blank\">CQPweb Help<\/a><span>\u00a0<\/span>page which you can access from the link here, or from bottom section of the navigation bar to the left of the CQPweb interface.<\/span><\/li>\n<\/ul>\n<figure class=\"wp-block-image size-medium is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/wou.edu\/linguistic-landscape-corpus\/files\/2021\/08\/cqpweb_screenshot3-300x244.jpg\" alt=\"\" class=\"wp-image-856\" width=\"510\" height=\"414\" srcset=\"https:\/\/wou.edu\/linguistic-landscape\/files\/2021\/08\/cqpweb_screenshot3-300x244.jpg 300w, https:\/\/wou.edu\/linguistic-landscape\/files\/2021\/08\/cqpweb_screenshot3-768x625.jpg 768w, https:\/\/wou.edu\/linguistic-landscape\/files\/2021\/08\/cqpweb_screenshot3.jpg 939w\" sizes=\"(max-width: 510px) 100vw, 510px\" \/><\/figure>\n<ul>\n<li><span class=\"tadv-color\">The CQPweb Help system is composed of a series of user-friendly<span>\u00a0<\/span><a href=\"https:\/\/www.youtube.com\/playlist?list=PL2XtJIhhrHNQgf4Dp6sckGZRU4NiUVw1e\" data-type=\"URL\" data-id=\"https:\/\/www.youtube.com\/playlist?list=PL2XtJIhhrHNQgf4Dp6sckGZRU4NiUVw1e\" target=\"_blank\" rel=\"noreferrer noopener\">YouTube tutorials<\/a><span>\u00a0<\/span>that explain how to perform everything from the most basic searches to more advanced corpus linguistic methods. The YouTube tutorials can be reached from the Help system page or directly from the \u201cVideo tutorials\u201d link or the links here.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;Where is the LL Corpus hosted?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">The Linguistic Landscape Corpus is generously hosted on the CQP Web server at Lancaster University.<span>\u00a0<\/span><a href=\"https:\/\/cqpweb.lancs.ac.uk\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/cqpweb.lancs.ac.uk\/<\/a><\/span><\/li>\n<li><span class=\"tadv-color\">You can cite CQPweb as follows:<\/span>\n<ul>\n<li><span class=\"tadv-color\">Hardie, A (2012) CQPweb \u2013 combining power, flexibility and usability in a corpus analysis tool.<span>\u00a0<\/span><em>International Journal of Corpus Linguistics<\/em><span>\u00a0<\/span>17 (3): 380\u2013409. [<a href=\"https:\/\/doi.org\/10.1075\/ijcl.17.3.04har\" target=\"_blank\" rel=\"noreferrer noopener\">DOI to Full text on publisher\u2019s website<\/a>]\u00a0 [<a href=\"http:\/\/www.lancaster.ac.uk\/staff\/hardiea\/cqpweb-paper.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">Alternative source for PDF<\/a>]<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How was the corpus created?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">At the conceptual level, around 2016 we began creating a version of the LL Corpus which contained journal articles and book chapters from 1997 through 2017 for a presentation at the 10<sup>th<\/sup><span>\u00a0<\/span>Linguistic Landscape Workshop (LLX in Bern, Switzerland).<\/span>\n<ul>\n<li><span class=\"tadv-color\">Troyer, R. (May 2018). 20 Years of Linguistic Landscape Studies: A Corpus Analysis of Publications. Presentation at the 10th annual Linguistic Landscapes Workshop. Bern, Switzerland.<\/span><\/li>\n<\/ul>\n<\/li>\n<li><span class=\"tadv-color\">That corpus served its purpose well, but we felt the best path forward would be to create a version that could be accessible to LL scholars. Many of the 357 articles and chapters in the first corpus had been modified so that variant spellings (i.e., American vs. British) were standardized and some elements of formatting that were not relevant for individual research use were inconsistent. The current LL Corpus of 548 items, however, has been created as an accurate (within the parameters discussed in this Manual) representation of the items with as much of the text of articles reproduced as possible with consistent formatting and extensive metadata that includes abstracts and web links to the publications.<\/span><\/li>\n<li><span class=\"tadv-color\">As for the nuts and bolts of corpus organization and creation, we maintain a master database of LL publications. All of these publications are fully referenced on the LL Bibliography on Zotero. We make every effort to obtain electronic full text of each of these items. When obtained (nearly always in pdf form), we use software to convert from pdf format to plain text.<span>\u00a0<\/span><\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;What were the challenges in creating the LL Corpus?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">Journal articles published prior to around 2013 were often in pdf formats that required extensive manual editing to obtain a accurate text versions, and even those published through 2017 were often problematic to convert accurately. The good news is that with newer conversion software and newer pdfs, adding journal articles from 2018 onward should be much faster and easier.<\/span><\/li>\n<li><span class=\"tadv-color\">The frequency with which LL publications contain tables, charts, figures, images and text in non-English languages and non-Roman script makes rendering clean, corpus-searchable files complicated with decisions needing to made about what to include and exclude and how much time can be allotted to manual editing.<\/span><\/li>\n<li><span class=\"tadv-color\">Obtaining electronic versions of book chapters is very difficult. In the cases of the specifically LL-themed collections listed above, this was not challenging, and we thank the editors of several of the above volumes for sharing the full texts of chapters with us. However, edited collections that are devoted to other or broader linguistic or sociological topics but that contain a chapter focused on the LL are very challenging to acquire and convert to text format\u2014thus, only around 67% of the chapters listed in the LL Bibliography were able to be included in the LL Corpus. Growth of the LL Studies. While we are reasonably certain that the LL Bibliography and LL Corpus up to 2018 is as comprehensive as possible, the maturation of the field has spawned so many publications in varied sources that maintaining a \u2018comprehensive\u2019 collection of LL publications may not be tenable after 2020 unless significantly more funding is available. Whether the LL Bibliography and LL Corpus can be continually updated or if they eventually become an archive of the first two decades of LL scholarship, we are confident that they provide the most inclusive repository of LL work that exists and we are committed to maintaining these resources in freely available online formats.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;Is the LL Corpus able to be downloaded by users?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">Due to copyright restrictions on most of what is in the LL Corpus, the actual corpus files<span>\u00a0<\/span><strong>are not publicly available<\/strong>, and access to the corpus is limited to the 200-word window around the node word of search results in the CQPweb interface. However, if researchers are interested in collaborating on a project that would necessitate use of the entire corpus outside of CQPweb, please send inquires to Rob Troyer (see<span>\u00a0<\/span><a href=\"https:\/\/wou.edu\/linguistic-landscape\/contact-us\/\" data-type=\"page\" data-id=\"773\">Contacts<span>\u00a0<\/span><\/a>page).<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;Will there be a future version?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">If there is enough user interest, future additions to the LL Corpus can include as many as possible of the 53 PhD Dissertations and Masters Theses that are referenced in the LL Bibliography on Zotero. Likewise, we possess pdfs of the vast majority of journal articles and book chapters published from 2017 through 2020, and following conversion and editing, these could be added to a future version of the LL Corpus.<\/span><\/li>\n<li><span class=\"tadv-color\">If you use the LL Corpus, please let us know (see the<span>\u00a0<\/span><a href=\"https:\/\/wou.edu\/linguistic-landscape\/contact-us\/\" data-type=\"page\" data-id=\"773\">Contacts<span>\u00a0<\/span><\/a>page) so that we can make a case for additional funding.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][et_pb_toggle title=&#8221;How was the creation of the LL Corpus funded?&#8221; use_icon_font_size=&#8221;on&#8221; icon_font_size=&#8221;30px&#8221; open_use_icon_font_size=&#8221;on&#8221; open_icon_font_size=&#8221;30px&#8221; admin_label=&#8221;Why doesn\u2019t the ECS degree come with a teaching license?&#8221; module_class=&#8221;aks-faq&#8221; _builder_version=&#8221;4.21.0&#8243; title_level=&#8221;h3&#8243; custom_margin=&#8221;0px||0px||false|false&#8221; animation_style=&#8221;slide&#8221; animation_direction=&#8221;left&#8221; animation_duration=&#8221;750ms&#8221; animation_intensity_slide=&#8221;5%&#8221; hover_enabled=&#8221;0&#8243; custom_css_main_element=&#8221;background-color: transparent;&#8221; border_width_all=&#8221;0px&#8221; border_width_bottom=&#8221;2px&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; custom_css_toggle_title_last_edited=&#8221;on|desktop&#8221; title_text_color__hover_enabled=&#8221;off|hover&#8221; title_text_color__hover=&#8221;#000000&#8243; sticky_enabled=&#8221;0&#8243;]<ul>\n<li><span class=\"tadv-color\">Creation of the corpus, directed by<span>\u00a0<\/span><a href=\"https:\/\/wou.edu\/resources\/faculty-staff-info\/?u=troyerr\" data-type=\"URL\" data-id=\"https:\/\/wou.edu\/resources\/faculty-staff-info\/?u=troyerr\" target=\"_blank\" rel=\"noreferrer noopener\">Rob Troyer<\/a>, was enabled by funding for undergraduate research assistants provided by Western Oregon University\u2019s Community Internship Program as well as Faculty Development Funding for a Major Projects Research Grant as well as Funding for travel to Linguistic Landscape Workshops and institutional subscriptions to journals and purchases of individual publications.<\/span><\/li>\n<\/ul>\n[\/et_pb_toggle][\/et_pb_column][\/et_pb_row][\/et_pb_section]\n","protected":false},"excerpt":{"rendered":"<p>LL Corpus FAQ What is the LL Corpus? The LL Corpus is&#8230;<\/p>\n","protected":false},"author":417,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_seopress_robots_primary_cat":"","_seopress_titles_title":"LL Corpus FAQ | Linguistic Landscape Resources","_seopress_titles_desc":"","_seopress_robots_index":"","_lmt_disableupdate":"no","_lmt_disable":"","_et_pb_use_builder":"on","_et_pb_old_content":"<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">What is the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How do I cite the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">What details about the LL Corpus should I be aware of?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">What is the purpose of the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How representative is the Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How do I access the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How do I search and analyze the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Where is the LL Corpus hosted?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How was the LL Corpus created?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">What were the challenges in creating the LL Corpus?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Is the LL Corpus able to be downloaded by users?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Will there be a future version?<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">How was the creation of the LL Corpus funded?<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>What is the LL Corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The LL Corpus is the full text of 383 published journal articles (1997-2017) and 165 book chapters (2008-2018) for a total of 548 items. The LL Corpus is a freely available resource intended for use by Linguistic Landscape scholars and students as well as corpus linguists. The CQPweb search interface enables users to perform anything from simple word searches to advanced corpus analysis. The LL Corpus does not infringe on copyright restrictions because the results of searches only display 200-words of text from any item. Links in the metadata for each item will direct users to the DOI or URL for the publication so that users can access the full text via institutional or individual methods.<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\"><strong>The LL Corpus is not<\/strong> a substitute for the original articles and chapters that it contains.<\/span><ul><li><span style=\"color:#000000\" class=\"tadv-color\"><strong>Tables, charts, figures, numerals and non-English text will be formatted differently, removed, or reproduced inaccurately<\/strong> due to the conversion process from published text to corpus-searchable plain text; however, every reasonable effort was taken to make the English text as accurate as possible. <\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Many of the works contained in the LL Corpus are under <strong>copyright <\/strong>and\/or not freely available. For this reason, users can only see 100 words to the left and 100 words to the right of a search term\/phrase of each text\u2014the full texts are not available. However, the links provided in the metadata should take you stable webpages where you can see either the full text or information on how to purchase or find institutional access to items.<\/span><\/li><\/ul><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How do I cite the LL Corpus? (APA)<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">Troyer, Robert A. (2021). Linguistic Landscape Corpus. CQPweb at Lancaster. https:\/\/cqpweb.lancs.ac.uk\/llscape202107\/<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>What details about the LL Corpus should I be aware of?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">In compiling the corpus, reasonable efforts were taken to ensure that the text of each article and chapter are accurate reflections of the published texts. Some details to be aware of: <\/span><ul><li><span style=\"color:#000000\" class=\"tadv-color\"><span style=\"background-color:#ffffff\" class=\"tadv-background-color\"><strong>spellings <\/strong>were not standardized--British and American variants remain as they were in the original articles, and any non-standard or infrequently used spellings (or misspellings) of words were maintained; thus, if you want to retrieve examples of both \u201cneighborhood\u201d and \u201cneighbourhood\u201d you will need to specify both forms in your search using parentheses and the alternative symbol | so that the search is typed as (neighborhood|neighbourhood). The corpus is lemmatized and lemma searches can be incorporated into alternates so that ({neighborhood}|{neighbourhood}) will retrieve both the singular and plural of both spelling variants.<\/span><\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">we attempted to remove all <strong>hyphens <\/strong>in the original text <strong>that were used when a word was divided at the end of a line of printed text<\/strong> so that in the corpus the word appears as a whole (and can be found in searches); however, some of these divided words might have gone undetected, and some intentional hyphens (in compound words that have optional hyphens, and in long URLs) may have been deleted in the process. These minor inconsistencies should not be statistically significant or detract from the usability of the corpus; however, if you notice mistakes, please let us know so that we can fix them in a future version.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">as stated above, whenever possible the text in <strong>tables, charts, and figures<\/strong> was maintained though not in the original format; however, when the words in illustrations were part of image files (not printed text) in the originals, the words could not be included in the corpus files.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><strong>images\/figures<\/strong> in original files and any linguistic items in the images are not included in the corpus files, but every attempt was made to include the labels and captions for all of these items so that they can be found in searches. As stated above, text in languages other than English, especially those that are not written in Latin\/Roman script, will vary greatly in accuracy and completeness in the corpus. Furthermore, all part of speech tagging and lemmatization is based on English structure. Creating a functioning multilingual corpus is beyond the scope of this project, so please rely only on the English text for search and analysis of the corpus while keeping in mind that some foreign words may have been automatically tagged with English part of speech tags.<\/span><\/li><\/ul><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>What is the purpose of the LL corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The primary aim of the LL Corpus is to enable LL scholars to find more detailed and accurate information from previous studies than is available from the LL Bibliography or from large academic databases that include publications that are not related to the field of Linguistic Landscape Studies.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Another aim is to encourage the democratization of LL research by making the work of lesser-cited authors just as accessible as that of more frequently cited scholars. When users perform a search for words or phrases they are interested in, they will obtain results from any and all publications in the corpus, and it is our hope that this leads scholars to publications they would not otherwise have discovered.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Because the LL Corpus contains metadata categories for year of publication, the corpus can be used to explore historical developments in the field of LL Studies. Similarly, the metadata category of publication type (article or book chapter) allows for comparisons between these two major publishing venues. From a corpus linguistics perspective, it is rarely feasible to create a specialized, discipline-specific corpus of publications that is highly representative of an academic field (See the following section for representativeness). Because the LL Corpus is hosted on the CQPweb server, users can perform genre studies\u2014for example, keyword analysis of the LL Corpus in comparison to the British National Corpus or to any of the other publicly available corpora that are also on CQPweb.<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How representative is the corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The <a rel=\"noreferrer noopener\" href=\"https:\/\/www.zotero.org\/groups\/216092\/linguistic_landscape_bibliography?\" data-type=\"URL\" data-id=\"https:\/\/www.zotero.org\/groups\/216092\/linguistic_landscape_bibliography?\" target=\"_blank\">LL Bibliography<\/a> on Zotero contains complete reference information for 1115 items (books, book chapters, journal articles, dissertations and theses, reports, and the annual LL Workshops). The LL Bibliography lists 427 journal articles from 1997-2017 and 247 book chapters from 2006-2018--the LL Corpus contains the full text of 383 of these articles (90%) and 165 chapters (67%) respectively from those years; thus, the LL Corpus contains approximately 80% of LL publications in journals and books during the respective periods. It is worth noting that the 10% of journal articles not included in the corpus are typically ones that were very difficult to access while the 33% of book chapters that were not included were present in a very wide variety of volumes the whole of which were not focused on LL Studies. On the other hand, the complete texts of the following edited collections of LL work are included in the LL Corpus.<\/span><ul><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Linguistic Landscape: Expanding the Scenery<\/em>. 2009<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Linguistic Landscape in the City<\/em>. 2010<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Semiotic Landscapes: Language, Image, Space<\/em>. 2010<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Linguistic Landscapes, Multilingualism and Social Change<\/em>. 2012<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Minority Languages in the Linguistic Landscape<\/em>. 2012<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Conflict, Exclusion and Dissent in the Linguistic Landscape<\/em>. 2015<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Negotiating and Contesting Identities in Linguistic Landscapes<\/em>. 2016<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\"><em>Expanding the Linguistic Landscape<\/em>. 2018<\/span><\/li><\/ul><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">A complete list of the metadata for each item in the LL Corpus is available <strong><a rel=\"noreferrer noopener\" href=\"http:\/\/LL_Corpus_Metadata_2021.xlsx\" data-type=\"URL\" data-id=\"LL_Corpus_Metadata_2021.xlsx\" target=\"_blank\">as an Excel file<\/a><\/strong> as well as <strong><a rel=\"noreferrer noopener\" href=\"http:\/\/LL_Corpus_items_2021.pdf\" data-type=\"URL\" data-id=\"LL_Corpus_items_2021.pdf\" target=\"_blank\">in a pdf<\/a> <\/strong>organized alphabetically by author's last name. <\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How do I access the LL Corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The LL Corpus is only available through CQPweb. See the <a href=\"https:\/\/wou.edu\/linguistic-landscape-corpus\/sign-up-for-access\/\" data-type=\"page\" data-id=\"768\">Sign Up For Access<\/a> page. <\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How do I search and analyze the LL Corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The best resource for learning how to search and analyze the corpora that are available on CQPweb is the <a rel=\"noreferrer noopener\" href=\"https:\/\/cqpweb.lancs.ac.uk\/usr\/help.php?ui=hello\" data-type=\"URL\" data-id=\"https:\/\/cqpweb.lancs.ac.uk\/usr\/help.php?ui=hello\" target=\"_blank\">CQPweb Help<\/a> page which you can access from the link here, or from bottom section of the navigation bar to the left of the CQPweb interface. <\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:image {\"id\":856,\"width\":510,\"height\":414,\"sizeSlug\":\"medium\",\"linkDestination\":\"none\"} -->\n<figure class=\"wp-block-image size-medium is-resized\"><img src=\"https:\/\/wou.edu\/linguistic-landscape-corpus\/files\/2021\/08\/cqpweb_screenshot3-300x244.jpg\" alt=\"\" class=\"wp-image-856\" width=\"510\" height=\"414\" \/><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The CQPweb Help system is composed of a series of user-friendly <a href=\"https:\/\/www.youtube.com\/playlist?list=PL2XtJIhhrHNQgf4Dp6sckGZRU4NiUVw1e\" data-type=\"URL\" data-id=\"https:\/\/www.youtube.com\/playlist?list=PL2XtJIhhrHNQgf4Dp6sckGZRU4NiUVw1e\" target=\"_blank\" rel=\"noreferrer noopener\">YouTube tutorials<\/a> that explain how to perform everything from the most basic searches to more advanced corpus linguistic methods. The YouTube tutorials can be reached from the Help system page or directly from the \"Video tutorials\" link or the links here.<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>Where is the LL Corpus hosted?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">The Linguistic Landscape Corpus is generously hosted on the CQP Web server at Lancaster University. <a href=\"https:\/\/cqpweb.lancs.ac.uk\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/cqpweb.lancs.ac.uk\/<\/a><\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">You can cite CQPweb as follows: <\/span><ul><li><span style=\"color:#000000\" class=\"tadv-color\">Hardie, A (2012) CQPweb - combining power, flexibility and usability in a corpus analysis tool. <em>International Journal of Corpus Linguistics<\/em> 17 (3): 380\u2013409. [<a href=\"https:\/\/doi.org\/10.1075\/ijcl.17.3.04har\" target=\"_blank\" rel=\"noreferrer noopener\">DOI to Full text on publisher's website<\/a>]\u00a0 [<a href=\"http:\/\/www.lancaster.ac.uk\/staff\/hardiea\/cqpweb-paper.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">Alternative source for PDF<\/a>]<\/span><\/li><\/ul><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How was the corpus created?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">At the conceptual level, around 2016 we began creating a version of the LL Corpus which contained journal articles and book chapters from 1997 through 2017 for a presentation at the 10<sup>th<\/sup> Linguistic Landscape Workshop (LLX in Bern, Switzerland).<\/span><ul><li><span style=\"color:#000000\" class=\"tadv-color\">Troyer, R. (May 2018). 20 Years of Linguistic Landscape Studies: A Corpus Analysis of Publications. Presentation at the 10th annual Linguistic Landscapes Workshop. Bern, Switzerland.<\/span><\/li><\/ul><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">That corpus served its purpose well, but we felt the best path forward would be to create a version that could be accessible to LL scholars. Many of the 357 articles and chapters in the first corpus had been modified so that variant spellings (i.e., American vs. British) were standardized and some elements of formatting that were not relevant for individual research use were inconsistent. The current LL Corpus of 548 items, however, has been created as an accurate (within the parameters discussed in this Manual) representation of the items with as much of the text of articles reproduced as possible with consistent formatting and extensive metadata that includes abstracts and web links to the publications. <\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">As for the nuts and bolts of corpus organization and creation, we maintain a master database of LL publications. All of these publications are fully referenced on the LL Bibliography on Zotero. We make every effort to obtain electronic full text of each of these items. When obtained (nearly always in pdf form), we use software to convert from pdf format to plain text. <\/span>&nbsp;<\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>What were the challenges in creating the LL Corpus?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">Journal articles published prior to around 2013 were often in pdf formats that required extensive manual editing to obtain a accurate text versions, and even those published through 2017 were often problematic to convert accurately. The good news is that with newer conversion software and newer pdfs, adding journal articles from 2018 onward should be much faster and easier.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">The frequency with which LL publications contain tables, charts, figures, images and text in non-English languages and non-Roman script makes rendering clean, corpus-searchable files complicated with decisions needing to made about what to include and exclude and how much time can be allotted to manual editing.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">Obtaining electronic versions of book chapters is very difficult. In the cases of the specifically LL-themed collections listed above, this was not challenging, and we thank the editors of several of the above volumes for sharing the full texts of chapters with us. However, edited collections that are devoted to other or broader linguistic or sociological topics but that contain a chapter focused on the LL are very challenging to acquire and convert to text format\u2014thus, only around 67% of the chapters listed in the LL Bibliography were able to be included in the LL Corpus. Growth of the LL Studies. While we are reasonably certain that the LL Bibliography and LL Corpus up to 2018 is as comprehensive as possible, the maturation of the field has spawned so many publications in varied sources that maintaining a \u2018comprehensive\u2019 collection of LL publications may not be tenable after 2020 unless significantly more funding is available. Whether the LL Bibliography and LL Corpus can be continually updated or if they eventually become an archive of the first two decades of LL scholarship, we are confident that they provide the most inclusive repository of LL work that exists and we are committed to maintaining these resources in freely available online formats.<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>Is the LL Corpus able to be downloaded by users?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">Due to copyright restrictions on most of what is in the LL Corpus, the actual corpus files <strong>are not publicly available<\/strong>, and access to the corpus is limited to the 200-word window around the node word of search results in the CQPweb interface. However, if researchers are interested in collaborating on a project that would necessitate use of the entire corpus outside of CQPweb, please send inquires to Rob Troyer (see <a href=\"https:\/\/wou.edu\/linguistic-landscape-corpus\/contact-us\/\" data-type=\"page\" data-id=\"773\">Contacts <\/a>page).<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>Will there be a future version?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">If there is enough user interest, future additions to the LL Corpus can include as many as possible of the 53 PhD Dissertations and Masters Theses that are referenced in the LL Bibliography on Zotero. Likewise, we possess pdfs of the vast majority of journal articles and book chapters published from 2017 through 2020, and following conversion and editing, these could be added to a future version of the LL Corpus.<\/span><\/li><li><span style=\"color:#000000\" class=\"tadv-color\">If you use the LL Corpus, please let us know (see the <a href=\"https:\/\/wou.edu\/linguistic-landscape-corpus\/contact-us\/\" data-type=\"page\" data-id=\"773\">Contacts <\/a>page) so that we can make a case for additional funding. <\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:heading {\"level\":4} -->\n<h4>How was the creation of the LL Corpus funded?<\/h4>\n<!-- \/wp:heading -->\n\n<!-- wp:list -->\n<ul><li><span style=\"color:#000000\" class=\"tadv-color\">Creation of the corpus, directed by <a href=\"https:\/\/wou.edu\/resources\/faculty-staff-info\/?u=troyerr\" data-type=\"URL\" data-id=\"https:\/\/wou.edu\/resources\/faculty-staff-info\/?u=troyerr\" target=\"_blank\" rel=\"noreferrer noopener\">Rob Troyer<\/a>, was enabled by funding for undergraduate research assistants provided by Western Oregon University\u2019s Community Internship Program as well as Faculty Development Funding for a Major Projects Research Grant as well as Funding for travel to Linguistic Landscape Workshops and institutional subscriptions to journals and purchases of individual publications.<\/span><\/li><\/ul>\n<!-- \/wp:list -->\n\n<!-- wp:paragraph -->\n<p><\/p>\n<!-- \/wp:paragraph -->","_et_gb_content_width":"","footnotes":"","_links_to":"","_links_to_target":""},"class_list":["post-770","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/pages\/770","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/users\/417"}],"replies":[{"embeddable":true,"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/comments?post=770"}],"version-history":[{"count":1,"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/pages\/770\/revisions"}],"predecessor-version":[{"id":1107,"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/pages\/770\/revisions\/1107"}],"wp:attachment":[{"href":"https:\/\/wou.edu\/linguistic-landscape\/wp-json\/wp\/v2\/media?parent=770"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}