Is Python call-by-value or call-by-reference? Neither.

There is one tricky but very popular problem I often encounter. How do you deal with value assignment in Python, for variable, list and tuple? In fact, this question has been answered by many people and I want to brief my thinking and list the reference. In Python (and other languages), a variable (and object … Continue reading Is Python call-by-value or call-by-reference? Neither.

How to Tune Algorithm Parameters with Scikit-Learn

by Jason Brownlee on July 16, 2014 in Python Machine Learning Machine learning models are parameterized so that their behavior can be tuned for a given problem. Models can have many parameters and finding the best combination of parameters can be treated as a search problem. In this post you will discover how to tune … Continue reading How to Tune Algorithm Parameters with Scikit-Learn

SQL syntax that you should pay attention to

The article will mainly talk about some pitfalls or important command of general SQL, just mark them and for future reference. Most of the content comes from SQL tutorial .   1. SQL Injection SQL injection is a technique where malicious users can inject SQL commands into an SQL statement, via web page input. Injected SQL … Continue reading SQL syntax that you should pay attention to

Difference between process and thread

Though, multiprocess and multithread are all accessible ways to do parallel computing and big data problems, there are some differences between these two terms and I will list some reference and thoughts about this topic. Each process provides the resources needed to execute a program. A process has a virtual address space, executable code, open handles to … Continue reading Difference between process and thread

Fast way to do multi url parsing

Since opening URL sequentially, especially hundreds of URL is very slow, it is a perfect case to implement parallel computing. There are two biggest components that determine the speed of this task: “opening URL” and “read the context from the website”. So, I will briefly talk about the fast way to do the multi-URL parsing. First, we … Continue reading Fast way to do multi url parsing

My daily working log

I will list some packages or knowledge during my internship, just for reference and casual discussion. Python Time Scheduler Scheduler(https://pypi.python.org/pypi/schedule) Perfectly solving the problem of auto email sending Auto Email Sending Google Cloud (https://cloud.google.com/) DAG Structure Airflow(http://nerds.airbnb.com/airflow/) Word Comparision FuzzyWuzzy (https://github.com/seatgeek/fuzzywuzzy) Difflib (https://docs.python.org/2/library/difflib.html) Difflib provides three types of comparison method, regarding the speed. Graphical Viewer snakeviz … Continue reading My daily working log

A Complete Tutorial on Ridge and Lasso Regression in Python

AARSHAY JAIN , JANUARY 28, 2016 / 39 Introduction When we talk about Regression, we often end up discussing Linear and Logistics Regression. But, that’s not the end. Do you know there are 7 types of Regressions ? Linear and logistic regression is just the most loved members from the family of regressions.  Last week, I saw a recorded … Continue reading A Complete Tutorial on Ridge and Lasso Regression in Python