本文共 52417 字,大约阅读时间需要 174 分钟。
Machine and Deep Learning with Python
Education
Tutorials and courses
- - Course for Python programming for the Humanities, assuming no prior knowledge. Heavy focus on text processing / NLP.
Pyplot
Material Databases
- - Natural language Understanding Toolkit
- and - Multi-algorithm ensemble learning packages.
- - Bolt Online Learning Toolbox
- - The Shogun Machine Learning Toolbox
Algorithms
- - Boruta: A wrapper algorithm for all-relevant feature selection
Cheatsheets
Theory and Use Cases
Astronomy
- - Machine Learning and Data Mining for Astronomy.
Law
Fraud Detection
Chat
Business and money
Bullying
Gaming
Recommendations
Text Analysis
Natural Language Processing
- - BLLIP Natural Language Parser (also known as the Charniak-Johnson parser)
- - Providing a consistent API for diving into common natural language processing (NLP) tasks. Stands on the giant shoulders of NLTK and Pattern, and plays nicely with both.
Sport
Image Recognition
Kaggle Competition
- - Industrial strength NLP with Python and Cython.
- - Python interface for converting Penn Treebank trees to Stanford Dependencies.
- - An implementation of Dell Zhang's solution to Wikipedia's Participation Challenge on Kaggle
- - Kaggle Submission for "Detecting Insults in Social Commentary"
- - Code for the Kaggle acquire valued shoppers challenge
- - Code for the CIFAR-10 competition at Kaggle, uses cuda-convnet
- - Deep learning made easy
- - Code for Accelerometer Biometric Competition at Kaggle
- - Predicting job salaries from ads - a Kaggle competition
- - Amazon access control challenge
- - Code for the Best Buy competition at Kaggle
- - Code for Kaggle Dovs vs. Cats competition
- - Winning solution for the Galaxy Challenge on Kaggle
- - A Kaggle competition: discriminate gender based on handwriting
- - Merck challenge at Kaggle
- - Predicting closed questions on Stack Overflow
- - Code for the Kaggle acquire valued shoppers challenge
- - Predicting wine quality
General-Purpose Machine Learning
- - Topic Modelling for Humans.
- -Restricted Boltzmann Machines in Python.
- - Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree
- - Machine learning for NeuroImaging in Python
- - A wrapper around scikit-learn that makes it simpler to conduct experiments.
- -
- - Python Environment for Bayesian Learning
- - Hidden Markov Models for Python, implemented in Cython for speed and efficiency.
- - Deep Learning In Python
Data Analysis / Data Visualization
- Pandas on PySpark (POPS)
- - ahaz: Regularization for semiparametric additive hazards regression
- - arules: Mining Association Rules and Frequent Itemsets
- - bigrf: Big Random Forests: Classification and Regression Forests for Large Data Sets
- - bst: Gradient Boosting
- - C50: C5.0 Decision Trees and Rule-Based Models
- - CORElearn: Classification, regression, feature evaluation and ordinal evaluation
- - CoxBoost: Cox models by likelihood based boosting for a single survival endpoint or competing risks
- - Cubist: Rule- and Instance-Based Regression Modeling
- - e1071: Misc Functions of the Department of Statistics (e1071), TU Wien
- - earth: Multivariate Adaptive Regression Spline Models
- - elasticnet: Elastic-Net for Sparse Estimation and Sparse PCA
- - ElemStatLearn: Data sets, functions and examples from the book: "The Elements of Statistical Learning, Data Mining, Inference, and Prediction" by Trevor Hastie, Robert Tibshirani and Jerome Friedman Prediction" by Trevor Hastie, Robert Tibshirani and Jerome Friedman
- - evtree: Evolutionary Learning of Globally Optimal Trees
- - fpc: Flexible procedures for clustering
- - frbs: Fuzzy Rule-based Systems for Classification and Regression Tasks
- - GAMBoost: Generalized linear and additive models by likelihood based boosting
- - gamboostLSS: Boosting Methods for GAMLSS
- - gbm: Generalized Boosted Regression Models
- - glmnet: Lasso and elastic-net regularized generalized linear models
- - glmpath: L1 Regularization Path for Generalized Linear Models and Cox Proportional Hazards Model
- - GMMBoost: Likelihood-based Boosting for Generalized mixed models
- - grplasso: Fitting user specified models with Group Lasso penalty
- - grpreg: Regularization paths for regression models with grouped covariates
- - hda: Heteroscedastic Discriminant Analysis
- - ipred: Improved Predictors
- - kernlab: Kernel-based Machine Learning Lab
- - klaR: Classification and visualization
- - lars: Least Angle Regression, Lasso and Forward Stagewise
- - lasso2: L1 constrained estimation aka ‘lasso’
- - LogicReg: Logic Regression
- - maptree: Mapping, pruning, and graphing tree models
- - mboost: Model-Based Boosting
- - medley: Blending regression models, using a greedy stepwise approach
- - mvpart: Multivariate partitioning
- - ncvreg: Regularization paths for SCAD- and MCP-penalized regression models
- - nnet: Feed-forward Neural Networks and Multinomial Log-Linear Models
- - oblique.tree: Oblique Trees for Classification Data
- - pamr: Pam: prediction analysis for microarrays
- - party: A Laboratory for Recursive Partytioning
- - partykit: A Toolkit for Recursive Partytioning
- - penalized: L1 (lasso and fused lasso) and L2 (ridge) penalized estimation in GLMs and in the Cox model
- - penalizedLDA: Penalized classification using Fisher's linear discriminant
- - penalizedSVM: Feature Selection SVM using penalty functions
- - quantregForest: Quantile Regression Forests
- - randomForest: Breiman and Cutler's random forests for classification and regression
- - randomForestSRC: Random Forests for Survival, Regression and Classification (RF-SRC)
- - rattle: Graphical user interface for data mining in R
- - rda: Shrunken Centroids Regularized Discriminant Analysis
- - rdetools: Relevant Dimension Estimation (RDE) in Feature Spaces
- - REEMtree: Regression Trees with Random Effects for Longitudinal (Panel) Data
- - relaxo: Relaxed Lasso
- - rgenoud: R version of GENetic Optimization Using Derivatives
- - Rmalschains: Continuous Optimization using Memetic Algorithms with Local Search Chains (MA-LS-Chains) in R
- - rminer: Simpler use of data mining methods (e.g. NN and SVM) in classification and regression
- - ROCR: Visualizing the performance of scoring classifiers
- - RoughSets: Data Analysis Using Rough Set and Fuzzy Rough Set Theories
- - rpart: Recursive Partitioning and Regression Trees
- - RPMM: Recursively Partitioned Mixture Model
- - RSNNS: Neural Networks in R using the Stuttgart Neural Network Simulator (SNNS)
- - RWeka: R/Weka interface
- - RXshrink: Maximum Likelihood Shrinkage via Generalized Ridge or Least Angle Regression
- - sda: Shrinkage Discriminant Analysis and CAT Score Variable Selection
- - SDDA: Stepwise Diagonal Discriminant Analysis
- - svmpath: svmpath: the SVM Path algorithm
- - tgp: Bayesian treed Gaussian process models
- - tree: Classification and regression trees
- - varSelRF: Variable selection using random forests
Video Streaming
Time
Audio
Python & Machine Learning
- ###Generic and unclassified
Misc Scripts / iPython Notebooks / Codebases
- - Biologically-Inspired and Machine Learning Algorithms in Python.
- - Topic Modeling the Sarah Palin emails.
- - A collection of image segmentation algorithms based on diffusion methods
- - SciPy tutorials. This is outdated, check out scipy-lecture-notes
- - A recommendation engine library for Python
- - Bayesian Inference Tools in Python
- - Series of notebooks for learning scikit-learn
- - Tweets Sentiment Analyzer
- - Sentiment classifier using word sense disambiguation.
- - Some experiments with the coordinate descent algorithm used in the (Sparse) Group Lasso model
- - Kanji / Hiragana / Katakana to Romaji Converter. Edict Dictionary & parallel sentences Search. Sentence Similarity between two JP Sentences. Sentiment Analysis of Japanese Text. Run Cabocha(ISO--8859-1 configured) in Python.
- - IPython notebooks for EEG/MEG data processing using mne-python
- - Recipes for using Python's pandas library
- - Book/iPython notebooks on Probabilistic Programming in Python
Tools
Deep Learning Frameworks
- - Evolutionary algorithm framework.
- - Genetic algorithm framework.
- - A deep learning framework developed with cleanliness, readability, and speed in mind.
- - A suite of ML tools designed to be easy to imbed in other applications
- - A fast out-of-core learning system.
- - Suite of fast incremental algorithms.
- - The Shogun Machine Learning Toolbox
- - A deep learning framework developed with cleanliness, readability, and speed in mind. [DEEP LEARNING]
- - Yet another deep learning framework with less than 1000 lines core code [DEEP LEARNING]
- - A parallelized optimized general purpose gradient boosting library.
- - A probabilistic programming language implementing full Bayesian statistical inference with Hamiltonian Monte Carlo sampling
- - A simple Multi-armed Bandit library.
Libraries
Environment Management
- - Dead Simple Interactive Python Version Management.
- - Simple Python version management.
- - A tool to create isolated Python environments.
- - A set of extensions to virtualenv.
- - An API for virtualenv and pip.
- - A set of tools to manage multiple virtual environments.
- - Run a command in the named virtualenv.
- - A one-file, no-installation-needed version of Python.
Package Management
- - The Python package and dependency manager. - Cross-platform, Python-agnostic binary package manager.
-
- - Curdling is a command line tool for managing Python packages.
- - The new standard of Python distribution and are intended to replace eggs.
Package Repositories
- - Next generation Python Package Repository (PyPI). - PyPI server and packaging/testing/release tool.
-
- - PyPI server which mirrors official packages on-demand, and also supports local (private) package uploads.
- - PyPI mirroring tool provided by Python Packaging Authority (PyPA)
Distribution
- - Freezes Python scripts (cross-platform).
- - Freezes Python scripts (Windows).
- - A tool to build Windows installers, installers bundle Python itself.
- - Freezes Python scripts (Mac OS X).
- - Converts Python programs into stand-alone executables (cross-platform).
- - Build and distribute a virtualenv as a Debian package.
- - Compile scripts, modules, packages to an executable or extension module.
Build Tools
- - A build system for creating, assembling and deploying applications from multiple parts, some of which may be non-Python-based.
- - A software construction tool.
- - A console tool to build code with different development platforms.
- - A make-like build tool with the special focus of distributions and packages for embedded Linux.
- - A build tool that finds dependencies automatically for any language.
Interactive Interpreter
- - A rich toolkit to help you make the most out of using Python interactively.
- – A fancy interface to the Python interpreter.
- - Advanced Python REPL built on top of the .
Files
- - (Python standard library) Map filenames to MIME types.
- - (Python standard library) Determine the type of an image.
- - A Python interface to the libmagic file type identification library.
- - A module wrapper for .
- - API and shell utilities to monitor file system events.
- - An object-oriented approach to file/directory operations.
- - (Python standard library in Python 3.4+) An cross-platform, object-oriented path library.
Date and Time
- - Better dates & times for Python.
- - A Python 3 library for parsing human-written times and dates.
- - Extensions to the standard Python module.
- - A library for clearing up the inconvenient truths that arise dealing with datetimes.
- - Providing user-friendly functions to help perform common date and time actions.
- - A Python library for dealing with dates/times. Inspired by .
- - World timezone definitions, modern and historical. Brings the into Python.
- - A easy-use Python module which aims to operate date/time/datetime by string.
Text Processing
- - Python bindings for eXtreme Gradient Boosting (Tree) Library
- - (Python standard library) Helpers for computing deltas.
- - Fast computation of Levenshtein distance and string similarity.
- - Fuzzy String Matching.
- - Regular expression accelerator.
- - A generator library for concise, unambiguous and URL-safe UUIDs.
- - Makes Unicode text less broken and more consistent automagically.
- - ASCII transliterations of Unicode text.
- - Python 2/3 compatible character encoding detector.
- - A library to translate Chinese hanzi (漢字) to pinyin (拼音).
- - Spacing texts for CJK and alphanumerics.
- - An implementation of figlet written in Python.
- - Print readable chars instead of the escaped string.
- Slugify
- - A Python slugify library that can preserve unicode.
- - A Python slugify library that translates unicode to ASCII.
- - Implementation of lex and yacc parsing tools for Python
- - Parsing, formatting, storing and validating international phone numbers.
- - Browser user agent parser.
- - A non-validating SQL parser.
- - A generic syntax highlighter.
- - Parsing human names into their individual components.
- - A general purpose framework for generating parsers.
Specific Formats Processing
Libraries for parsing and manipulating specific text formats.
- General
- - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
- Office
- - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
- / - Writing and reading data and formatting information from Excel files.
- - A Python module for creating Excel .xlsx files.
- - A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
- - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
- - Takes Python data structures and turns them into spreadsheets.
- - Convert between any document format supported by LibreOffice/OpenOffice.
- PDF
- - A tool for extracting information from PDF documents.
- - A library capable of splitting, merging and transforming PDF pages.
- - Allowing Rapid creation of rich PDF documents.
- Markdown
- - A Python implementation of John Gruber’s Markdown.
- - Fastest and full featured pure Python parsers of Markdown.
- YAML
- - YAML implementations for Python.
- CSV
- - Utilities for converting to and working with CSV.
- Archive
- - A command line tool that can unpack archives easily.
Natural Language Processing
- - Open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data & other Natural Language Processing tasks.
- - Memory-based NLP suite developed for Dutch: PoS tagger, lemmatiser, dependency parser, NER, shallow parser, morphological analyzer.
- - A leading platform for building Python programs to work with human language data.
- - A web mining module for the Python programming language. It has tools for natural language processing, machine learning, among others.
- - A python framework to transform natural language questions to queries in a database query language
- - A sentence aligner, a friendly tool for extracting parallel sentences from comparable corpora.
- - Chinese Words Segmentation Utilities.
- - A library for processing Chinese text.
- - Another Chinese segmentation library.
- - A Chinese segment base on Conditional Random Field.
- - Text processing tools and wrappers (e.g. Vowpal Wabbit)
- - Python bindings for the BLLIP Natural Language Parser (also known as the Charniak-Johnson parser)
- - Python Natural Language Processing Library. General purpose NLP library for Python. Also contains some specific modules for parsing common NLP formats, most notably for , but also ARPA language models, Moses phrasetables, GIZA++ alignments.
- - Python binding to ucto (a unicode-aware rule-based tokenizer for various languages)
- - Python binding to Frog, an NLP suite for Dutch. (pos tagging, lemmatisation, dependency parsing, NER)
- - Python binding to C++ library for extracting and working with with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.
- - A leading platform for building Python programs to work with human language data.
- - A web mining module for the Python. It has tools for natural language processing, machine learning, among others.
- - Providing a consistent API for diving into common NLP tasks. Stands on the giant shoulders of NLTK and Pattern.
- - Chinese Words Segmentation Utilities.
- - A library for processing Chinese text.
- - Another Chinese segmentation library.
- - A Chinese segment base on Conditional Random Field.
- - Stand-alone language identification system.
Documentation
- - Python Documentation generator. - Markup Syntax and Parser Component of Docutils.
-
- - Markdown friendly documentation generator.
- - The original quick-and-dirty, hundred-line-long, literate-programming-style documentation generator.
- - Epydoc replacement to auto generate API documentation for Python libraries.
Configuration
- - (Python standard library) INI file parser.
- - INI file parser with validation.
- - Hierarchical config from the author of .
- - Config from multiple formats with value conversion.
Command-line Tools
- Command-line Application Development
- - Cement provides a light-weight and fully featured foundation to build anything from single file scripts to complex and intricately designed applications.
- - A package for creating beautiful command line interfaces in a composable way.
- - Python Command-line Application Tools.
- - A framework for creating command-line programs with multi-level commands.
- – Clime lets you convert any module into a multi-command CLI program without any configuration.
- - Pythonic command line arguments parser.
- - Cross-platform colored terminal text.
- - Command-line applications supporting standard command line parsing, logging, unit and functional testing.
- - Turn command line programs into a full GUI application with one line
- - A Library for building powerful interactive command lines.
- Productivity Tools
- - A command-line utility that creates projects from cookiecutters (project templates). E.g. Python package projects, jQuery plugin projects.
- - A command line HTTP client, a user-friendly cURL replacement.
- - Adds flavor of interactive selection to the traditional pipe concept on UNIX.
- - Smart and nice Twitter client on terminal.
- - Determine what projects are blocking you from porting to Python 3.
- - Correcting your previous console command.
- - A tool for live presentations in the terminal.
- - Select files out of bash output.
- - Making basic plots in the terminal. It's a quick way to visualize data without GUI.
Downloader
- - A command line tool for managing Amazon S3 and CloudFront.
- - Super S3 command line tool, good for higher performance.
- - A small command-line program to download videos from YouTube.
- - A YouTube/Youku/Niconico video downloader written in Python 3.
- - Script for downloading Coursera.org videos and naming them.
- - Tools for downloading and preserving wikis.
- - Library and command line tool to search and download subtitles.
Imagery
- - Pillow is the friendly fork.
- - Python bindings for , C API for ImageMagick.
- - A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.
- - A project for searching a collection of images using visual similarity.
- - A pure Python QR Code generator.
- - Create barcodes in Python without needing PIL.
- - Instagram-like image filters.
- - Computer art based on quadtrees.
- - Nudity detection.
- - A Python library for (scientific) image processing.
- - Image histogram remapping.
OCR
- - A wrapper class for .
- - Another wrapper for Google Tesseract OCR.
- - A wrapper for Tesseract and Cuneiform.
Audio
- - Expressive Digital Signal Processing (DSP) package for Python.
- - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
- - A music library manager and tagger.
- - Audio fingerprinting and recognition.
- - A tool for working with audio files, specifically MP3 files containing ID3 metadata.
- - A Python module for reading MP3 meta data.
- - A Python module to handle audio metadata.
- - Manipulate audio with a simple and easy high level interface.
- - Python client for the API.
- - A Python library for speech/signal processing.
- - Open web audio processing framework.
- - A library for reading music meta data of MP3, OGG, FLAC and Wave files.
- - A module for parsing m3u8 file.
- - A data visualization package based on the grammar of graphics.
Video
- - A module for script-based movie editing with many formats, including animated GIFs.
- - Video summarization.
- - Video processing routines for SciPy.
Geolocation
- - Python Geocoding Toolbox.
- - Pure Python GeoIP API.
- - Python API for MaxMind GeoIP Legacy Database.
- - Python bindings and utlities for GeoJSON.
HTTP
- - HTTP Requests for Humans™.
- - requests + gevent for asynchronous HTTP requests.
- - A HTTP library with thread-safe connection pooling, file post support, sanity friendly.
- - Comprehensive HTTP client library.
- - Python requests like API built on top of Twisted's HTTP client.
Database
- - A native object database for Python. A key-value and object graph database.
- - A simple and lightweight key-value store for Python.
- - A tiny, document-oriented database.
Database Drivers
- Relational Databases
- - The MySQL database connector for Python.
- - mysql-python fork supporting Python 3.
- - Pure Python MySQL driver compatible to mysql-python.
- - A pure Python MySQL driver from Oracle.
- - A better MySQL connector with support for native prepared statements and BLOBs.
- - The most popular PostgreSQL adapter for Python.
- - Twisted based asynchronous driver for PostgreSQL.
- - A wrapper of the psycopg2 library for interacting with PostgreSQL.
- - Store Python dicts in a database - works with SQLite, MySQL, and PostgreSQL.
- - Another Python SQLite wrapper.
- NoSQL Databases
- - Python driver for Cassandra.
- - Python Thrift driver for Cassandra.
- - A developer-friendly library for Apache HBase.
- - The official Python client for MongoDB.
- - A fast and feature-rich Python interface to LevelDB.
- - The Redis Python Client.
- - Python wrapper client for Neo4j's restful interface.
- - Twisted based client for Cassandra.
- - Twisted based client for Redis.
ORM
- Relational Databases
- - The Python SQL Toolkit and Object Relational Mapper.
- - A small, expressive ORM.
- - ORM that provides a generator-oriented interface to SQL.
- NoSQL Databases
- - A Python Object-Document-Mapper for working with MongoDB.
- - Django MongoDB Backend.
- - A Python Library for Simple Models and Containers Persisted in Redis.
- - Object mapper for Amazon DynamoDB.
- Others
- - A Python ORM for Google Drive Spreadsheets.
Computer Vision
- - An open source computer vision framework that gives access to several high-powered computer vision libraries, such as OpenCV. Written on Python and runs on Mac, Windows, and Ubuntu Linux.
- - Python bindings for the VIGRA C++ computer vision library.
Web Frameworks
- - The most popular web framework in Python.
- - A microframework for Python. - A fast, simple and lightweight WSGI micro web-framework.
-
- - A small, fast, down-to-earth, open source Python web framework. - A full stack web framework and platform focused in the ease of use.
-
- - A web framework for Python that is as simple as it is powerful.
- - The Web Framework that starts as a microframework and scales up to a full stack solution.
- - A Minimalist Python Web Framework, HTTP/1.1-compliant and WSGI thread-pooled.
- - A framework built on the existing Zope 3 libraries.
- - An open-source web application server, framework and library, formerly known as Zope 3.
- - A lightweight and high performance web framework for Python written in C.
Permissions
- - Implementation of per object permissions for Django 1.2+
- - A tiny but powerful app providing object-level permissions to Django, without requiring a database.
- - Module to align code with thoughts of users and designers. Also magically handles navigation and permissions.
CMS
- - An Open source enterprise CMS based on the Django.
- - A lightweight but yet powerful Django CMS with plugins, inline editing and performance in mind.
- - One of the most advanced Content Management Systems built on Django.
- - A high-level, Pythonic web application framework built on Pyramid.
- - A powerful, consistent, and flexible content management platform.
- - A Django-based CMS for magazines, newspapers websites and portals with high-traffic.
- - A CMS built on top of the open source application server Zope.
- - Flexible, extensible, small CMS powered by Flask and MongoDB.
- - A Django content management system.
- - Last CMS framework, based on Django.
E-commerce
- - An open-source e-commerce framework for Django.
- - A Django based shop system.
- - A Django app to accept payments from various payment processors.
- - Money class with optional CLDR-backed locale-aware formatting and an extensible currency exchange solution.
- - Display money format and its filthy currencies.
- - Unofficial Alipay API for Python.
RESTful API
- - A REST framework for Pyramid.
- - A powerful and flexible toolkit that makes it easy to build Web APIs.
- - Creating delicious APIs for Django apps.
- - Create JSON APIs with HMAC authentication and Django form-validation.
- - An implementation of the same web browsable APIs that django-rest-framework provides.
- - An extension for Flask that adds support for quickly building REST APIs.
- - A Flask extension for generating ReSTful APIs for database models defined with SQLAlchemy (or Flask-SQLAlchemy).
- - Flask extension that takes care of API representation and authentication.
- - A high-performance Python framework for building cloud APIs and web app backends.
- - REST API framework powered by Flask, MongoDB and good intentions.
- - Automated REST APIs for existing database-driven systems.
- - Framework agnostic REST framework based on lessons learned from TastyPie.
- - REST API building library (django, and others)
- - A tool for quickly creating REST/HATEOAS/Hypermedia APIs with extensions for Flask and Django.
Authentication
- OAuth
- - Simple but powerful framework agnostic authentication/authorization client package.
- - A generic, spec-compliant, thorough implementation of the OAuth request-signing logic.
- - A Python library for OAuth 1.0/a, 2.0, and Ofly.
- - A fully tested, abstract interface to creating OAuth clients and servers.
- - An easy-to-setup social authentication mechanism.
- - OAuth2 goodies for the Djangonauts.
- - Providing OAuth2 access to Django app.
- - Authentication app for Django that "just works."
- - OAuth 1.0/a, 2.0 implementation of client and provider for Flask.
- - A dead simple OAuth2 client implementation.
- Others
- - Implementation of the JSON Web Token draft 01.
- - Module for generating and verifying JSON Web Tokens.
- - Implementation of JSON Web Signatures draft 02.
Template Engine
- - A modern and designer friendly templating language.
- - Python templating toolkit for generation of web-aware output.
- - Hyperfast and lightweight templating for the Python platform.
- - An HTML/XML template engine. Modeled after ZPT, optimized for speed.
- - A very fast Python template compiler.
Queue
- - An asynchronous task queue/job queue based on distributed message passing.
- - Little multi-threaded task queue.
- - Mr. Queue - A distributed worker task queue in Python using Redis & gevent.
- - Simple job queues for Python.
- - A simple, infinitely scalable, Amazon SQS based queue.
Search
- - Modular search for Django.
- - The official low-level Python client for .
- - The official high-level Python client for Elasticsearch.
- - A Python client for .
- - A fast, pure Python search engine library.
News Feed
- - A library to build newsfeed and notification systems using Cassandra and Redis.
- - Generate generic activity streams from the actions on your site.
Asset Management
- - A Jinja extension to compile and compress your assets.
- - Bundles, optimizes, and manages unique cache-busting URLs for static resources.
- - Packages, optimizes, and serves static file dependencies as Python packages.
- - Monitors changes, processes, and transports assets to CDNs and file storage systems.
- - A collection of custom storage back ends for Django.
- - Glue is a simple command line tool to generate CSS sprites.
- - A Python binding of , the reference implementation of SASS/SCSS.
- - Helps you integrate webassets into your Flask app.
Caching
- - A library for caching and sessions for use with web applications and stand-alone Python scripts and applications.
- - dogpile.cache is next generation replacement for Beaker made by same authors.
- - Python caching library with tag-based invalidation and dogpile effect prevention.
- - Automatic caching and invalidation for Django models through the ORM.
- - A slick ORM cache with automatic granular event-driven invalidation.
- - A caching framework for django applications.
- - Render template parts with extended cache control.
- - A Python wrapper around the interface.
Email
- - Python SMTP Server for Humans.
- - Python IMAP for Humans.
- - The open source email toolkit.
- - Pythonic SMTP Application Server.
- - A email address and Mime parsing library.
- - High-performance extensible mail delivery framework.
- - Django email back end with AWS SES and Celery.
- - A mail hosting and management platform including a modern and simplified Web UI.
- - Mailing for human beings.
- - Mailjet API implementation for batch mailing, statistics and more.
- - Mailgun library to extract message quotations and signatures.
- - Compose, send and parse emails.
Internationalization
- - An internationalization library for Python.
- - A library for morphology.
URL Manipulation
- - A small Python library that makes manipulating URLs simple.
- - A simple, immutable URL class with a clean API for interrogation and manipulation.
- - A pure Python URL shortening lib.
- - Python implementation for generating Tiny URL and bit.ly-like URLs.
- - A friendly library for parsing HTTP request arguments, with built-in support for popular web frameworks, including Flask, Django, Bottle, Tornado, and Pyramid.
HTML Manipulation
- - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
- - A very fast, easy-to-use and versatile library for handling HTML and XML.
- - A standards-compliant library for parsing and serializing HTML documents and fragments.
- - A jQuery-like library for parsing HTML.
- - A CSS library for Python.
- - Implements a XML/HTML/XHTML Markup safe string for Python.
- - A whitelist-based HTML sanitization and text linkification library.
- - Working with XML feel like you are working with JSON.
- - HTML/CSS to PDF converter.
- - Converts XML documents to Python objects for easy access.
Web Crawling
- - A fast high-level screen scraping and web crawling framework.
- - Visual scraping for Scrapy.
- - Universal feed parser.
- - A simple, Pythonic library for browsing the web without a standalone web browser.
- - A Python library for automating interaction with websites.
- - Stateful programmatic web browsing.
- - PyQuery-based scraping micro-framework.
- - A distributed crawling framework.
- - A powerful spider system.
- - Site scraping framework.
Web Content Extracting
- - News extraction, article extraction and content curation in Python.
- - Convert HTML to Markdown-formatted text.
- - HTML Content/Article Extractor.
- - Web Content Retrieval for Humans.
- - A small library for extracting rich content from URLs.
- - A module for automatic summarization of text documents and HTML pages.
- - An Extensible Image Crawler.
- - Fast Python port of arc90's readability tool.
- - A Python module to parse the Open Graph Protocol
- - Extract text from any document, Word, PowerPoint, PDFs, etc.
- - Bringing sanity to world of messed-up data.
Forms
- - A flexible forms validation and rendering library.
- - A WTForms extension for JSON data handling.
- - Python HTML form generation library influenced by the formish form generation library.
- - Bootstrap 3 integration with Django.
- - A Django app which lets you create beautiful forms in a very elegant and DRY way.
- - A platform independent Django form serializer.
Data Validation
- - A mappings-validator with a variety of rules, normalization-features and simple customization that uses a pythonic schema-definition.
- - A Python data validation library. It is primarily intended for validating data coming into Python as JSON, YAML, etc.
- - A system for validating and deserializing data obtained via XML, JSON, an HTML form post or any other equally simple data serialization.
- - A library for validating Python data structures.
- - Data Structure Validation.
- - A language for matching/validating/filtering Python dictionaries.
- - Lightweight extensible data validation and adaptation library.
Anti-spam
- - Simple spam blocker for Django.
- - A simple and highly customizable Django app to add captcha images to any Django form.
Tagging
- - Simple tagging for Django.
Admin Panels
- - The admin panel your servers deserve.
- – A jazzy skin for the Django Admin-Interface.
- - Alternative Django Admin-Interface (free only for Non-commercial use).
- - Drop-in replacement of Django admin comes with lots of goodies.
- - Simple and extensible administrative interface framework for Flask.
- - Real-time monitor and web admin for Celery.
Static Site Generator
- - Uses Markdown or ReST for content and Jinja 2 for themes. Supports DVCS, Disqus. AGPL.
- – Static site generator for designers.
- - Jinja2-based static web site generator.
- - A static website and blog generator.
- - Tinkerer is a blogging engine/.static website generator powered by Sphinx.
Processes and Threads
- - (Python standard library) Process-based "threading" interface.
- - (Python standard library) Higher-level threading interface.
- - Python Subprocesses for Humans™.
- - A full-fledged replacement for Python.
- - A wrapper for subprocess.
Competition and Networking
- - (Python standard library in Python 3.4+) Asynchronous I/O, event loop, coroutines and tasks.
- - A coroutine-based Python networking library that uses .
- - An event-driven networking engine.
- - A Web framework and asynchronous networking library.
- - Event-driven concurrent framework for Python.
- - Greenlet-based event I/O Framework for Python.
- - Asynchronous framework with WSGI support.
- - A Python wrapper for the 0MQ message library.
- - Twisted based wrapper for the 0MQ message library.
- - Open-source Unified Application Router (Websocket & WAMP for Python on Autobahn).
WebSocket
- - WebSocket & WAMP for Python on Twisted and .
- - WebSocket client and server library for Python 2 and 3 as well as PyPy.
WSGI Servers
- - A project aims at developing a full stack for building hosting services, written in C.
- - A WSGI utility library for Python that powers Flask and can easily be embedded into your own projects.
- - Multi-threaded, stable, tried and tested.
- - Multi-threaded.
- - Multi-threaded, poweres Pyramid.
- - Asynchronous, very fast.
- - Pre-forked, partly written in C.
- - Asynchronous (network side only), written in C.
- - Asynchronous, partly written in C.
- - Asynchronous, very fast and written in C.
RPC Servers
- - (Python standard library) Simple XML-RPC server implementation, single-threaded.
- - This library is an implementation of the JSON-RPC specification.
- - zerorpc is a flexible RPC implementation based on and .
Cryptography
- - The Python Cryptography Toolkit.
- - A Python (2.6+, 3.3+) implementation of the SSHv2 protocol, providing both client and server functionality.
- - A package designed to expose cryptographic primitives and recipes to Python developers.
- - Python binding to the Networking and Cryptography (NaCl) library.
- - Implementation of in Python.
- - Secure password storage/hashing library, very high level.
GUI
- - Python bindings for the cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.
- - Python bindings for the cross-platform application and UI framework, supporting the Qt v4 framework.
- - A blending of the wxWidgets C++ class library with the Python.
- - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.
- - Built-in wrapper for used to create terminal GUI applications.
- - A library for creating terminal GUI applications with strong support for widgets, events, rich colors, etc.
- - A cross-platform windowing and multimedia library for Python.
- - Tkinter is Python's de-facto standard GUI package.
- - Creating beautiful user-interfaces with Declaratic Syntax like QML.
- - A Python native, OS native GUI toolkit.
Game Development
- - Pygame is a set of Python modules designed for writing games.
- - cocos2d is a framework for building 2D games, demos, and other graphical/interactive applications. It is based on pyglet.
- - A ctypes based wrapper for the SDL2 library.
- - 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.
- - Python bindings for the Ogre 3D render engine, can be used for games, simulations, anything 3D.
- - Python ctypes bindings for OpenGL and it's related APIs.
- - Python bindings for
- - A Visual Novel engine.
Logging
- - (Python standard library) Logging facility for Python.
- - Logging replacement for Python.
- - A realtime logging and aggregation server.
- - The Python client for Sentry.
- - Logging for complex & distributed systems.
Testing
- Testing Frameworks
- - (Python standard library) Unit testing framework.
- - nose extends unittest.
- - A mature full-featured Python testing tool.
- - The definitive testing tool for Python. Born under the banner of BDD.
- - A BDD framework for Python 3.3+. Inspired by C#'s
Machine.Specifications
. - - Should style asserts based on .
- - BDD style testing for Python. Inspired by .
- - Hypothesis is an advanced Quickcheck style property based testing library.
- Web Testing
- - Python bindings for WebDriver.
- - Open source tool for testing web applications.
- - Scalable user load testing tool written in Python.
- - A language-agnostic A/B Testing framework.
- Mock
- - A Python Mocking and Patching Library for Testing.
- - A utility library for mocking out the requests Python library.
- - Powerful test doubles framework for Python.
- - Travel through time by mocking the datetime module.
- - HTTP request mock tool for Python.
- - A mocking library for requests for Python 2.6+ and 3.2+.
- Code Coverage
- - Code coverage measurement.
- Fake Data
- - A Python package that generates fake data.
- - Fake database generator.
- - A test fixtures replacement for Python.
- - Another fixtures replacement. Supported Django, Flask, SQLAlchemy, Peewee and etc.
- - Creating random fixtures for testing in Django.
- - Generate random datetime / time.
- Error Handler
- - FuckIt.py uses state-of-the-art technology to make sure your Python code runs whether it has any right to or not.
Code Analysis and Linter
- Code Analysis
- - A type inferencer and indexer for Python.
- - A library that visualises the flow (call graph) of your Python application.
- - Turn your Python and JavaScript code into DOT flowcharts.
- Linter
- - The modular source code checker: pep8, pyflakes and co.
- - Code audit tool for Python and JavaScript.
- - A source code analyzer.
Debugging Tools
- - (Python standard library) The Python Debugger.
- - IPython-enabled pdb.
- - An improbable web debugger through WebSockets.
- - A Platform Independent Python Debugger with GUI, capable of remote debugging based on rpdb2.
- – A full-screen, console-based Python debugger.
- - Debugger capable of attaching to and injecting code into Python processes.
- - Python Client for the server.
- - Monitor Memory usage of Python code.
- - An interactive Python profiler.
- - Display various debug information about the current request/response.
- - A pure-Python library for parsing and analyzing ELF files and DWARF debugging information.
Science and Data Analysis
- - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
- - A fundamental package for scientific computing with Python.
- - Python JIT (just in time) complier to LLVM aimed at scientific Python by the developers of Cython and NumPy.
- - A high-productivity software for complex networks.
- - A library providing high-performance, easy-to-use data structures and data analysis tools.
- - Business Intelligence (BI) in Python (Pandas web interface)
- - Markov Chain Monte Carlo sampling toolkit.
- - A Pythonic algorithmic trading library.
- - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion based around NumPy, SciPy, IPython, and matplotlib.
- - A Python library for symbolic mathematics.
- - Statistical modeling and econometrics in Python.
- - A community Python library for Astronomy.
- - Data mining, data visualization, analysis and machine learning through visual programming or Python scripting.
- - Cheminformatics and Machine Learning Software.
- - A chemical toolbox designed to speak the many languages of chemical data.
- - A library for parsing and interpreting the results of computational chemistry packages.
- - Biopython is a set of freely available tools for biological computation.
- - Collection of useful code related to biological analysis.
- - A toolkit providing best-practice pipelines for fully automated high throughput sequencing analysis.
- - NumPy and Pandas interface to Big Data.
Data Visualization
- - A Python 2D plotting library.
- - Interactive Web Plotting for Python.
- - Collaborative web plotting for Python and matplotlib.
- - A Python to Vega translator.
- - A plotting library for Python, based on .
- - Same API as ggplot2 for R.
- - Rendering beautiful SVG maps in Python.
- - A Python SVG Charts Creator.
- - Python interface to .
- - Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.
- - High-performance scientific visualization based on OpenGL.
Computer Vision
- - Open Source Computer Vision Library.
- - An open source framework for building computer vision applications.
Machine Learning
- - A Python module for machine learning built on top of SciPy.
- - Web mining module for Python.
- - Numenta Platform for Intelligent Computing.
- - A Machine Learning library based on .
- - GPU-Accelerated Deep Learning Library in Python.
- - Topic Modelling for Humans.
- - Another Python Machine Learning Library.
- - A flexible, fast recommender engine.
- - A Python library for implementing a Recommender System.
- - A lightweight Python wrapper for .
MapReduce
- - The Spark Python API.
- - Python clone of Spark, a MapReduce alike framework in Python.
- - A module that helps you build complex pipelines of batch jobs.
- - Run MapReduce jobs on Hadoop or Amazon Web Services.
- - Python module that allows one to easily write and run Hadoop programs.
- - Run Python code against real-time streams of data. Integrates with .
Functional Programming
- - Functional programming in Python: implementation of missing features to enjoy FP.
- - A fancy and practical functional tools.
- - A collection of functional utilities for iterators, functions, and dictionaries.
- - Cython implementation of Toolz: High performance functional utilities.
Third-party APIs
- - One Python library for all clouds.
- - Python interface to Amazon Web Services.
- - A Python wrapper for the Twitter API.
- - Google APIs Client Library for Python.
- - Google Spreadsheets Python API.
- - Facebook Platform Python SDK.
- - Facepy makes it really easy to interact with Facebook's Graph API
- - A Pythonic interface for Gmail.
DevOps Tools
- - Open source software for building private and public clouds.
- - A radically simple IT automation platform.
- - Infrastructure automation and management system.
- - A simple, Pythonic tool for remote execution and deployment.
- - Tools for writing awesome Fabric files.
- - Chef-like functionality for Fabric.
- - A cross-platform process and system utilities module.
- - Controlling interactive programs in a pseudo-terminal like GNU expect.
- - An easy-to-use provisioning system in Python.
- - A Python port of , a tool for managing Procfile-based applications.
- - Multipurpose task execution tool for distributed systems with web-based interface.
- - Fast, isolated development environments using .
- - Pure-Python API for Mercurial.
- - Pure-Python API for git.
- - Supervisor process control system for UNIX.
Job Scheduler
- - A light but powerful in-process task scheduler that lets you schedule functions.
- - A task runner/build tool.
- - A set of tools to provide lightweight pipelining in Python.
- - Writing crontab file in Python like a charm.
- - A powerful workflow engine implemented in pure Python.
- - Python job scheduling for humans.
- - A Python library that helps to make task execution easy, consistent and reliable.
Foreign Function Interface
- - (Python standard library) Foreign Function Interface for Python calling C code.
- - Foreign Function Interface for Python calling C code.
- - Simplified Wrapper and Interface Generator.
- - A Python wrapper for Nvidia's CUDA API.
High Performance
- - Optimizing Static Compiler for Python. Uses type mixins to compile Python into C or C++ modules resulting in large performance gains.
- - An implementation of Python in Python. The interpreter uses black magic to make Python very fast without having to add in additional type information.
- - An enhanced version of the Python.
- - A Python implementation built using LLVM and modern JIT techniques with the goal of achieving good performance.
Microsoft Windows
- - Python Extensions for Windows.
- - Python Integration with the .NET Common Language Runtime (CLR).
- - Unofficial Windows binaries for Python extension packages.
- - IDE for the Python language with advanced editing, interactive testing, debugging and introspection features (also comes with Anaconda, WinPython).
- - Scientific-applications-oriented Python Distribution based on Qt and Spyder.
- - Portable development environment for Windows 7/8.
Network Virtualization and SDN
- - A popular network emulator and API written in Python.
- - An open source development platform for Python-based Software Defined Networking (SDN) control applications, such as OpenFlow SDN controllers.
- - A member of the Frenetic family of SDN programming languages that provides powerful abstractions over network switches or emulators.
- - SDN based IXP implementation that leverages Mininet, POX and Pyretic.
Hardware
- - A module for cross-platform control of the mouse and keyboard.
- - A Python library and command line tool for working with WiFi on Linux.
- - A brilliant packet manipulation library.
- - Command line toolkit for working with .
- - Python Robotics.
Compatibility
- - Python 2 and 3 compatibility utilities.
- - The missing compatibility layer between Python 2 and Python 3.
- - Modernizes Python code for eventual Python 3 migration.
Miscellaneous
- - A simple but flexible plugin system for Python.
- - Various helpers to pass trusted data to untrusted environments.
- - A fast Python in-process signal/event dispatching system.
- - A framework for creating and tracking achievements.
Algorithms and Design Patterns
- - A collection of design patterns in Python.
- - A module of algorithms for Python.
Editor Plugins
- Vim
- - An all in one plugin for turning Vim into a Python IDE.
- - Vim bindings for the auto-completion library for Python.
- - Includes -based completion engine for Python
- Emacs
- - Emacs Python Development Environment.
- Sublime Text
- - A Sublime Text plugin to the awesome auto-complete library .
- - Anaconda turns your Sublime Text 3 in a full featured Python development IDE.
- Atom - A Machine Learning library based on .
- - A static code analysis tool for Atom.
- - An addon to
linter
, that acts as an interface for flake8
. - - Atom package for virtualenv management.
-
- - Modular neural network library based on .
- - GPU-Accelerated Deep Learning Library in Python.
- - Another Python Machine Learning Library.
- - A flexible, fast recommender engine.
- - A Python library for implementing a Recommender System.
- - Theano based library for deep and recurrent neural networks
- - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.
- - A library to let Python program run on Hadoop.
- - Spearmint is a package to perform Bayesian optimization according to the algorithms outlined in the paper: Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Hugo Larochelle and Ryan P. Adams. Advances in Neural Information Processing Systems, 2012.
- - Markov Chain Monte Carlo sampling toolkit.
- - A Pythonic algorithmic trading library.
- - A Python library for symbolic mathematics.
- - R binding for eXtreme Gradient Boosting (Tree) Library
- - rgp: R genetic programming framework
- - A framework for fast, parallel, and distributed machine learning algorithms at scale -- Deeplearning, Random forests, GBM, KMeans, PCA, GLM
- - Classification and Regression Training: Unified interface to ~150 ML algorithms in R.
- - caretEnsemble: Framework for fitting multiple caret models as well as creating ensembles of such models.
- - bigRR: Generalized Ridge Regression (with special advantage for p >> n cases)
- - bmrm: Bundle Methods for Regularized Risk Minimization Package
- - Library for calculating great circle distance.
- - Optimization library focused on machine learning, pythonic implementations of gradient descent, LBFGS, rmsprop, adadelta and others
- - Code for Data Science at Olin College, Spring 2014.
- - Code repository for Think Bayes.
- - Code for Allen Downey's book Think Complexity.
- - Text and supporting code for Think OS: A Brief Introduction to Operating Systems.
- - A Python extension module wrapping the full TiMBL C++ programming interface. Timbl is an elaborate k-Nearest Neighbours machine learning toolkit.
- - A library consisting of useful tools for data science and machine learning tasks.
- - Nervana's Python-based Deep Learning framework
- - Optimizing GPU-meta-programming code generating array oriented optimizing math compiler in Python
- - Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python.
- - NumPy and Pandas interface to Big Data.
- - The Python ensemble sampling toolkit for affine-invariant MCMC.
- - A Python Framework for Wind Energy Analysis and Prediction
- - GPU-based high-performance interactive OpenGL 2D/3D data visualization library
- A web-based visualization and debugging platform for NuPIC.
- An all-in-one NuPIC Hierarchical Temporal Memory visualization and debugging super-tool!
- - Business Intelligence (BI) in Python (Pandas web interface)
- - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion based around NumPy, SciPy, IPython, and matplotlib.
- - Statistical modeling and econometrics in Python.
- - A community Python library for Astronomy.
- - A Python 2D plotting library.
- - Interactive Web Plotting for Python.
- - Collaborative web plotting for Python and matplotlib.
- - A Python to Vega translator.
- - A plottling library for Python, based on .
- - Same API as ggplot2 for R.
- - Rendering beautiful SVG maps in Python.
- - A Python SVG Charts Creator.
- - A pure-python graphics and GUI library built on PyQt4 / PySide and NumPy.
Data analysis
- - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
- - A fundamental package for scientific computing with Python.
- - Python JIT (just in time) complier to LLVM aimed at scientific Python by the developers of Cython and NumPy.
- - A high-productivity software for complex networks.
- - A library providing high-performance, easy-to-use data structures and data analysis tools. ## Sequence Analysis
- - This is an objected-oriented framework that facilitates the integration of probabilistic models for sequences over a user defined alphabet.
IDEs
- - Commercial Python IDE based on the IntelliJ platform by JetBrains. Has free community edition available.
- - Commercial polyglot IDE with support for Python.
- - Free polyglot IDE based on Eclipse. Uses PyDev for Python support.
- - Open Source Python IDE.
- - Commercial IDE for Python.
Resources
Websites
- - News about Python.
- - Too many popular Python packages don't support Python 3.
- - Good place to find new Python libraries.
- - List of top 400 projects in GitHub.
- - Sharing cool github projects just got easier!
- - Plain English explanations for every layer of the Python web application stack.
Weekly
Blogs/Podcasts
Data Science / Statistics
Maths
Security Related
Books
Machine-Learning / Data Mining
- - Book + R Code
- - Book + IPython Notebooks
- - Book + Python Code
- - Alex Smola and S.V.N. Vishwanathan
- - Amnon Shashua
Natural Language Processing
Neural Networks
Probability & Statistics
- - Book + Python Code
- - Book and course by MIT
Linear Algebra
Community
Twitter
Lists
Events
Conferences
Meetups
转载地址:http://tzfwl.baihongyu.com/