Intensive Social Media Weeks

Introduction:

1. Computational Social Science
http://www.barabasilab.com/pubs/CCNR-ALB_Publications/200902-06_Science-CompSocial/200902-06_Science-CompSocial.pdf
2. Users of the world, unite! The challenges and opportunities of Social Media
http://michaelhaenlein.com/Publications/Kaplan,%20Andreas%20-%20Users%20of%20the%20world,%20unite.pdf
 
 
Introduction to Information Retrieval and natural language processing:
http://nlp.stanford.edu/IR-book/html/htmledition/irbook.html
 
Community Question Answering:
/CQA_references_glossary.pdf
 
NodeXL:
Instruction: http://users.jyu.fi/~alsemeno/Demo%201%20(NodeXL).pdf
Official website http://nodexl.codeplex.com/
 
 
MLlib | Apache Spark:
Spark Programming Guide https://spark.apache.org/docs/1.3.1/programming-guide.html
Spark Summit 2014 Training Materials https://spark-summit.org/2014/training
M. Zaharia et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf
H. Karau et al. Learning Spark: Lightning-Fast Big Data Analysis http://shop.oreilly.com/product/0636920028512.do
 
 
Semantic Web - technology of knowledge representation in the web:
http://www.ianus-fdz.de/attachments/95/2007-07-Berners-Lee_Semantic_Web.pdf
http://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf
http://linkeddatabook.com/editions/1.0/
http://www.ted.com/talks/tim_berners_lee_on_the_next_web - TED Next Web Video
http://www.w3.org/standards/semanticweb/- SemWeb Standarts W3C
http://linkeddata.org/ - Linked Data
 
Main terms:
Graph/Network, Edge/arc, Vertex/node, Degree, Centrality, Cluster, Community detection, Population, Sample, Sample bias, Representative sample, Data noise, Lemmatization/stemming, Bag of words, Vector model, Similarity measure (distance), Fuzzy clustering, Quality function optimization, API
Semantic Web (technology of knowledge representation in the web), RDF (standart of resource description), Ontologies and Vocabularies, OWL (language for ontology description), Triplestores Reasoning, Linked Data (initiative to link existing datasets), LOD Cloud and Knowledge Graphs, SPARQL queries (queries to the triplestore), Federated queries (queries to the linked data cloud), Data alignment (linking of entities from different datasets)