DistributedDataMining/en

From BC-Wiki
Jump to navigation Jump to search
Ddm-logo.png

Time Series Analysis

Stock Price Prediction (active) Part of our research is devoted to Time Series Analysis. Our focus is on forecasting economic time series such as DAX and Dow Jones. At first, we focused on the application of artificial neural networks to forecast time series. A detailed description on this approach, the design of the experimental setting as well as the results are presented in [5]. Later on, we applied support vector machines to avoid the high computational complexity of neural networks. The resulting forecasts are equally impressive even though the necessary computational costs can be decreased significantly. In 2008, we published two related studies [6] and [7]. We extended our studies by using various learning algorithms in order to determine there applicability for stock price prediction. After analyzing the obtained results we made two important observations: (i) the influence of the learning algorithm is much lower than expected, but instead (ii) the training window size has a stronger impact on the quality of the prediction. Since, so far, temporal effects are rarely addressed in the literature, we concentrate in our dDM-project on the study of these temporal aspects in time series analysis.


Social Network Analysis Tanja Falkowski proposed DenGraph - a density-based graph clustering algorithm [1]. This algorithm is deployable for - among other things - Social Network Analysis. The following studies were part of her PhD theses that is published as a book.

Temporal Dynamics of the Last.fm Music Platform (temporarily suspended) In this application we applied DenGraph-IO to detect and observe changes in the music listening behaviour of Last.fm users during a period of two years. The aim was to see, whether the proposed clustering technique detects meaningful communities and evolutions [2], [3].

Temporal Evolution of Communities in the Enron Email Data Set (finished) The collapse of Enron, a U.S. company honored in six consecutive years by "Fortune" as "America's Most Innovative Company", caused one of the biggest bankruptcy cases in US-history. To investigate the case, a data set of approximately 1.5 million e-mails sent or received by Enron employees was published by the Federal Energy Regulatory Commission. We've used the processing power of dDM to analyze the temporal evolution of communities extracted from these email correspondences [4].

Medical Data Analysis For the clinical diagnosis of pathological conditions of the human body a variety of sophisticated examination techniques are employed. In usual clinical time frames the amount of time available for analysing and interpreting the acquired data is limited. As a result, diagnostic failure may occur, which can have serious consequences for the affected patient. Medical Data Analysis and computer-aided diagnosis systems can be provided to the physician, facilitating clinical his decisions and yielding more reliable identification of pathological alterations.

Laryngeal high-speed video classification (on-going) The automatic identification of voice disorders is one particular field of interest of Daniel Voigt's work. Audio recordings of the acoustical voice signal are analysed with specialized software quantifying the amount of perturbation (noise) in the signal. Through automated feature extraction from the recordings and subsequent machine learning analysis, laryngeal movement patterns can be quantitatively captured and automatically classified according to different diagnostic classes.

Erde.png DistributedDataMining
Start 2010
End April 2018
Status finished
Admin Nico Schlitter
Institution -
Country Germany
Area Grid/Informatic
Info.png Apps
Win.png Win Biological Data Analysis: Laryngeal Video Classification 1.36
Biological data Analysis: Multi-Agent Simulation of Evolution 1.05
Linux2.png Linux Time Series Analysis: Stock Price Prediction 16.00
Biological Data Analysis: Laryngeal Video Classification 1.36
Macos.gif Mac
Amd64.jpg 64bit Time Series Analysis: Stock Price Prediction 16.00 [linux]
Biological Data Analysis: Laryngeal Video Classification 1.36 [linux/win]
Biological data Analysis: Multi-Agent Simulation of Evolution 1.05 [win]
Ps3.png PS3
Ati.jpg ATI
Cuda.jpg CUDA
Intel.jpg Intel
Android.png Android No.gif
Raspberri Pi.png RPi No.gif
Nci.jpg NCI No.gif
Specs.png System-Specs
VRAM SP No.gif DP No.gif
RAM 200MB
254MB (MDA)
Runtime sek-30h (SPP)
8min (MDA)
HDD 22MB
Traffic dl/ul kb/kb
Deadline 5 days
21 days (MDA)
Checkpoints Yes.gif