跳至主要内容

博文

目前显示的是 八月, 2018的博文

using org2jekyll for blog publishing

switch to org2jekyll package from homegrown solution For some time, I was using the `autoinsert’ function to prepare Jekyll post. I have the following auto-insert define in my .emacs, so whenever I create a new markdown file in Jekyll ` post ` directory it prompts me for the Jekyll header. (define-auto-insert '("\\.markdown" . "Jekyll Markdown Post") '("TITLE: " "---\nlayout: post\ntitle: " str "\ndate: " (format-time-string "%Y-%m-%d %H:%m:%S %z") "\ncategories: " ("CATEGORY: " str " ") -1 "\n---\n" _ "\n") t) It turns out the procedure is too error prone. I got into various error (filename convention, markdown syntax, etc.) many times when trying to publish my post. Today I have the org2jekyll setted up, all steps need to publish a jekyll post is now simplified to: M-x org2jekyll-create-draft Anwser the questions as usual for Jekyl...

Enabling Native Acceleration for MLlib

The undefined symbol issue Got onto the ship of machine learning. And soon I hit the wall of `undefined symbol issue’ on lab cluster. Hi, it just the simple `MinMaxScaler’ example code published on spark MLlib(Machine Learning) guide!! Everything goes well until the last line `scaledData.show()`, boom. Spark-shell died with the following message on the console: /usr/java/jdk1.7.0_67-cloudera/bin/java: symbol lookup error: /tmp/jniloader82069440205403545netlib-native_system-linux-x86_64.so: undefined symbol: cblas_daxpy NO log. NO history server record. Nothing could be used for debug as first glance. Solution (Wrap up) My solution is based on CDH 5.11.0 (parcel) plus cloudera GPLExtra (parcel) plus Intel MKL library (parcel). The steps to enable MKL native acceleration for cloudera spark should be as simple as: Install `netlib-java` by integrate GPLExtra parcel as described in the Enable Native Accerleration For MLlib Install MKL library parcel by follow the Download Intel Ma...