* using log directory ‘/data/gannet/ripley/R/packages/tests-Suggests/tm.Rcheck’ * using R Under development (unstable) (2024-04-20 r86457) * using platform: x86_64-pc-linux-gnu * R was compiled by gcc-13 (GCC) 13.2.0 GNU Fortran (GCC) 13.2.0 * running under: Fedora Linux 36 (Workstation Edition) * using session charset: UTF-8 * using option ‘--no-stop-on-test-error’ * checking for file ‘tm/DESCRIPTION’ ... OK * this is package ‘tm’ version ‘0.7-13’ * checking package namespace information ... OK * checking package dependencies ... NOTE Packages suggested but not available for checking: 'Rcampdf', 'tm.lexicon.GeneralInquirer' * checking if this is a source package ... OK * checking if there is a namespace ... OK * checking for executable files ... OK * checking for hidden files and directories ... OK * checking for portable file names ... OK * checking for sufficient/correct file permissions ... OK * checking whether package ‘tm’ can be installed ... [55s/28s] OK * used C compiler: ‘gcc-13 (GCC) 13.2.0’ * used C++ compiler: ‘g++-13 (GCC) 13.2.0’ * checking package directory ... OK * checking ‘build’ directory ... OK * checking DESCRIPTION meta-information ... OK * checking top-level files ... OK * checking for left-over files ... OK * checking index information ... OK * checking package subdirectories ... OK * checking code files for non-ASCII characters ... OK * checking R files for syntax errors ... OK * checking whether the package can be loaded ... OK * checking whether the package can be loaded with stated dependencies ... OK * checking whether the package can be unloaded cleanly ... OK * checking whether the namespace can be loaded with stated dependencies ... OK * checking whether the namespace can be unloaded cleanly ... OK * checking loading without being on the library search path ... OK * checking use of S3 registration ... OK * checking dependencies in R code ... OK * checking S3 generic/method consistency ... OK * checking replacement functions ... OK * checking foreign function calls ... OK * checking R code for possible problems ... [18s/18s] OK * checking Rd files ... OK * checking Rd metadata ... OK * checking Rd line widths ... OK * checking Rd cross-references ... OK * checking for missing documentation entries ... OK * checking for code/documentation mismatches ... OK * checking Rd \usage sections ... OK * checking Rd contents ... OK * checking for unstated dependencies in examples ... OK * checking contents of ‘data’ directory ... OK * checking data for non-ASCII characters ... OK * checking data for ASCII and uncompressed saves ... OK * checking line endings in C/C++/Fortran sources/headers ... OK * checking pragmas in C/C++ headers and code ... OK * checking compilation flags used ... OK * checking compiled code ... OK * checking sizes of PDF files under ‘inst/doc’ ... OK * checking installed files from ‘inst/doc’ ... OK * checking files in ‘vignettes’ ... OK * checking examples ... ERROR Running examples in ‘tm-Ex.R’ failed The error most likely occurred in: > ### Name: TermDocumentMatrix > ### Title: Term-Document Matrix > ### Aliases: TermDocumentMatrix DocumentTermMatrix as.TermDocumentMatrix > ### as.DocumentTermMatrix > > ### ** Examples > > data("crude") > tdm <- TermDocumentMatrix(crude, + control = list(removePunctuation = TRUE, + stopwords = TRUE)) > dtm <- DocumentTermMatrix(crude, + control = list(weighting = + function(x) + weightTfIdf(x, normalize = + FALSE), + stopwords = TRUE)) > inspect(tdm[202:205, 1:5]) <> Non-/sparse entries: 6/14 Sparsity : 70% Maximal term length: 9 Weighting : term frequency (tf) Sample : Docs Terms 127 144 191 194 211 companies 1 1 0 0 0 company 1 0 0 1 0 companys 0 0 1 0 0 compared 0 0 0 0 1 > inspect(tdm[c("price", "prices", "texas"), c("127", "144", "191", "194")]) <> Non-/sparse entries: 8/4 Sparsity : 33% Maximal term length: 6 Weighting : term frequency (tf) Sample : Docs Terms 127 144 191 194 price 2 1 2 2 prices 3 5 0 0 texas 1 0 0 2 > inspect(dtm[1:5, 273:276]) <> Non-/sparse entries: 6/14 Sparsity : 70% Maximal term length: 9 Weighting : term frequency - inverse document frequency (tf-idf) Sample : Terms Docs companies company company's compared 127 2.736966 2.321928 0.000000 0.000000 144 2.736966 0.000000 0.000000 0.000000 191 0.000000 0.000000 4.321928 0.000000 194 0.000000 2.321928 0.000000 0.000000 211 0.000000 0.000000 0.000000 2.736966 > > s <- SimpleCorpus(VectorSource(unlist(lapply(crude, as.character)))) > m <- TermDocumentMatrix(s, + control = list(removeNumbers = TRUE, + stopwords = TRUE, + stemming = TRUE)) Error in loadNamespace(x) : there is no package called ‘SnowballC’ Calls: TermDocumentMatrix ... loadNamespace -> withRestarts -> withOneRestart -> doWithOneRestart Execution halted * checking for unstated dependencies in ‘tests’ ... OK * checking tests ... Running ‘testthat.R’ OK * checking for unstated dependencies in vignettes ... OK * checking package vignettes ... OK * checking re-building of vignette outputs ... ERROR Error(s) in re-building vignettes: --- re-building ‘extensions.Rnw’ using Sweave Loading required package: NLP --- finished re-building ‘extensions.Rnw’ --- re-building ‘tm.Rnw’ using Sweave Loading required package: NLP Error: processing vignette 'tm.Rnw' failed with diagnostics: chunk 12 (label = Stemming) Error in loadNamespace(x) : there is no package called ‘SnowballC’ --- failed re-building ‘tm.Rnw’ SUMMARY: processing the following file failed: ‘tm.Rnw’ Error: Vignette re-building failed. Execution halted * checking PDF version of manual ... [11s/12s] OK * checking HTML version of manual ... OK * checking for non-standard things in the check directory ... OK * checking for detritus in the temp directory ... OK * DONE Status: 2 ERRORs, 1 NOTE See ‘/data/gannet/ripley/R/packages/tests-Suggests/tm.Rcheck/00check.log’ for details. Command exited with non-zero status 1 Time 1:59.11, 128.33 + 14.64