Handy Links for Subversion:
Download Subversion Client
http://subversion.tigris.org/faq.html
This blog focuses on the relationships that connect us together providing potent insights for decision makers. In addition, a few data mining topics are presented.
Saturday, March 18, 2006
Friday, March 17, 2006
Tuesday, January 31, 2006
GEDCOM file format information
The following links discuss the how the GEDCOM format is defined:
Cyndi's List
The GEDCOM Standard Release 5.5
GEDCOM: The Next Generation
Cyndi's List
The GEDCOM Standard Release 5.5
GEDCOM: The Next Generation
Friday, January 20, 2006
Networked Data File Types
Here are reference links to common network data file types used in Link Mining and Social Network Analysis:
Pajek Net File
UCINet DL Files and VNA
Pajek Net File
UCINet DL Files and VNA
Friday, December 09, 2005
Machine Learning Topics
Particle Swarm Optimization
wikipedia
Swarm Intelligence
Ant Algorithms
ant colony optimization
Reinforcement Learning
wikipedia
Q-learning
Q-learning definition
Markov decision process
Computational Learning Theory
wikipedia
VC dimension
Principle of maximum entropy
Ensembles, Bagging and Boosting
Boosting
Meta-Learning
METAL KDD
Christophe Giraud-Carrier
HMMs
Hidden Markov model
wikipedia
Swarm Intelligence
Ant Algorithms
ant colony optimization
Reinforcement Learning
wikipedia
Q-learning
Q-learning definition
Markov decision process
Computational Learning Theory
wikipedia
VC dimension
Principle of maximum entropy
Ensembles, Bagging and Boosting
Boosting
Meta-Learning
METAL KDD
Christophe Giraud-Carrier
HMMs
Hidden Markov model
Saturday, October 29, 2005
Viral Marketing
Dr. Ralph F. Wilson suggests that Viral Marketing is comprised of the following components:
1. Gives away products or services
2. Provides for effortless transfer to others
3. Scales easily from small to very large
4. Exploits common motivations and behaviors
5. Utilizes existing communication networks
6. Takes advantage of others' resources
The effects of word-of-mouth, or viral marketing are motivations for utilizing the social network that customers belong in.
1. Gives away products or services
2. Provides for effortless transfer to others
3. Scales easily from small to very large
4. Exploits common motivations and behaviors
5. Utilizes existing communication networks
6. Takes advantage of others' resources
The effects of word-of-mouth, or viral marketing are motivations for utilizing the social network that customers belong in.
Friday, September 16, 2005
Customer Segmentation
Customer analysis helps a business better meet customer needs. Learning more about your customers is often benefited by intelligent segmentation. Customers can be segmented into a variety of groups. These segments can be based on behavioural, demographic, geographic, and psychographic variables. In fact customers can be segmented by any combination of these variables. Through viewing customers within such segments the problem of identifying and serving customers is simplifed. The knowledge provided by these segments is usually useful for determining actionable marketing tactics.
Tuesday, September 13, 2005
Stanford Data Mining Course
Stanford offers a nice Data Mining and Electronic Business course within the Statistics department. It looks like it covers many exciting aspects of the field.
Thursday, July 28, 2005
What is Lift?
In data mining, "lift" is often used to measure model performance. Here is a link to an article that explains how it is used: DMReview article
Wednesday, July 20, 2005
IP Country Lookup Tool

Here is a link to a tool that I created to lookup the countries for all of the IP addresses in a mess of text.
Thursday, June 30, 2005
Idea: Transaction Logger
It would be great to create a device that logs all of my transactions (whatever the method used to purchase) that could be carried around while shopping. This would then enable consumers to do personal data-mining on all of their own transactions. This would be a unique tool to help consumers improve purchasing habits and make smarter decisions. Additional interesting product associations could also be calculated and analyzed.
Friday, June 17, 2005
Web Data Mining (for Business Intelligence)
Bamshad Mobasher teaches a nice Web Mining course entitled "Web Data Mining (for Business Intelligence)" at DePaul University in Illinois. Currently, it is one of the few courses dedicated solely to this topic. I expect, as time goes, the number of courses on this topic will grow dramatically.
Tuesday, June 14, 2005
Exploring Bayesian Methods
Bayesian methods can be used to deal with uncertainty.
Here are some links that help to explore the area:
Bayesian Inference
Empirical Bayes
Hierarchal Bayes
Bayesian Network
Bayes' theorem
Statistics Topics
Expected Value
Likelihood
Mean
Variance
Mean Squared Error (MSE)
Posterior Probability
Conditional, Joint, and Marginal Probability
Utility Functions (Link 2)
Distributions
Normal
Gamma
Poisson
Beta
Binomial
Conjugate Prior
Other Related Topics
Markov Chain Monte Carlo (MCMC)
Simulated Annealing
Tabu Search (Link 2)
Kalman Filter (Link 2)
Particle Filter
Directed Acyclic Graphs (DAG)
Markovian Random Field (MRF)
EM Algorithm (Bayesian Structural EM - Friedman)
Reading List of Bayesian Methods
Helpful Software
JavaBayes
Graphviz
Useful Java Libraries
Colt
Tomato
Thursday, June 02, 2005
Web Content Mining: Bing Liu
Bing Liu from the University of Chicago is very interested in Web Content Mining. He compiled of list of references regarding the topic. In addition, he gave a tutorial on Web content mining in Chiba, Japan in May, 2005.
Thursday, April 21, 2005
Natural Language Processing
Here is a nice introduction and dictionary for Natural Langauge Processing (NLP). This reference might come in handy when mining text documents.
Subscribe to:
Comments (Atom)