1 | Core lexical query | TS = (“Big Data” or Bigdata or “Map Reduce” or MapReduce or Hadoop or Hbase or Nosql or Newsql) | 8,602 |
2 | Expanded lexical query | TS = ((Big Near/1 Data or Huge Near/1 Data) or “Massive Data” or “Data Lake” or “Massive Information” or “Huge Information” or “Big Information” or “Large-scale Data” or Petabyte or Exabyte or Zettabyte or “Semi-Structured Data” or “Semistructured Data” or “Unstructured Data”) | 11,798 |
| | TS = (“Cloud Comput*” or “Data Min*” or “Analytic*” or “Privacy” or “Data Manag*” or “Social Media*” or “Machine Learning” or “Social Network*” or “Security” or “Twitter*” or “Predict*” or “Stream*” or “Architect*” or “Distributed Comput*” or “Business Intelligence” or “GPU” or “Innovat*” or “GIS” or “Real-Time” or “Sensor Network*” or “Smart Grid*” or “Complex Network*” or “Genomics” or “Parallel Comput*” or “Support Vector Machine” or “SVM” or “Distributed” or “Scalab*” or “Time Serie*” or “Data Science” or “Informatics*” or “OLAP”) | 3,113,113 (part A AND part B = 7,696) |
3 | #1 OR (#2 AND #3); 2006–2016 | SCI = 4,673; SSCI = 1,026, of which 541 are not also in SCI –download 541; AHCI (not in SSCI) = 45 down; CPCI-S & CPCI-SSH = 6,267 (of which 6,093 not in SCI-SSCI – download) – hit 5,000 limit. so split – download 6,093; BCI-S & BCI-SSH = 376 – download all (ignore possible overlaps) | |
| | ESCI – search #1 = 0; so leave that dB out; ** save the separate downloaded into VP files in case we want to analyze sometime – note trend behavior for 2015 differs greatly from SCI/SSCI (UP) to CPCI’s (DOWN). I think due largely to incomplete indexing at this date in WoS. Also saved the combo – 11,728 total – removed dups to get 11,684 (saved with the component files on the flash memory). | |