Ref. code  
C‑2017‑01

Project 1: Live sports narrator

[ Project description | Close ]

This project is to create a Live Sports Narrator capable of producing a fluent, human-like, real-time running commentary of a sports event using video along with associated data. Such as system requires substantial technology advances on almost all dimensions of cognitive computing and cloud including Speech, Vision, Natural Language Processing (NLP), Knowledge & Reasoning (K&R), Machine Learning (ML), Deep Learning (DL), Natural Language Generation (NLG), prediction and forecasting, and real-time processing. We will develop a system that observes live sport events, models game context, detects and understands salient dynamic content, such as actions of players, and creates a natural language intonated speech description that provides real-time running commentary similar to what human sports commentators do. In this project, we expect the intern could help us on:

  • Action analysis and movement prediction in real time video using CNNs
  • Interested objects detection in video
  • Video captioning

Desired skills

Ph.D. or Master Students majored in computer science or electronic engineering with the following skills or experience:

  • Computer vision using deep learning neural networks
  • Python/C++ Programming
  • Hands on skills on Caffe or TensorFlow
  • Image processing methods and tools, such as OpenCV
  • Knowledge for Natural Language Processing (NLP) is preferred.
C‑2017‑02

Project 2: Neural network on heterogeneous platform

[ Project description | Close ]

Deep neural network (DNN) is a high density computing workload. Meanwhile the energy efficiency is another important metric. There are some kinds of hardware platform for the neural network computation like CPU, GPU, FPGA and ARM. This project explores the neural network computation on these heterogeneous platform with different framework and programming model. Meanwhile software stack overhead is considered, targeting to find out an optimized platform configuration to run the application. In this project, we expect the intern could help us on one or multiple actions:

  • Mapping the neural network computation to GPU/FPGA
  • Tuning the performance on heterogeneous platform especially on GPU and/or FPGA
  • Profiling and optimizing the software stack for an end-to-end DNN service

Desired skills

Ph.D. or Master Students majored in computer science or electronic engineering with the following skills or experience:

  • Knowledge of computer architecture.
  • Good experience on FPGA or GPU development.
  • Python/CUDA/C/C++ Programming
  • Hands on skills on Caffe or TensorFlow
  • Experience on language like OpenCL, SDAccel, HLS is plus.
C‑2017‑03

Project 3: Blockchain fabric

[ Project description | Close ]

Blockchain technology is changing the way the world does business. Old business models are being made more efficient, secure and transparent. New business models are emerging. We are looking for interns interested in conducting research on novel blockchain security, distributed computing, and information retrieval algorithms. Areas of interest should include databases, cryptography, distributed & parallel algorithms, language design, trusted computing, and/or secure hardware design. Practical experience designing and implementing cryptographic software and database engines is especially valued.

Desired skills

We are looking for Ph.D or Masters students majoring in Computer Science, Applied Mathematics (Cryptography), or related areas, with the following desired skills:

  • Distributed systems
  • Database engine design, query compilation and optimization
  • Proficiency with Go is a plus
  • Consensus algorithms such as PBFT or Raft
  • Hands-on experience building software systems
  • Creative, independent, and self-motivated
  • Team collaboration
C‑2017‑04

Project 4: Cognitive internet of things (IoT) analytics platform services

[ Project description | Close ]

The Internet of Things (IoT) research of IBM Research – China is leading to develop world class technology and cognitive solution research innovations enabling the enterprises across industries to accelerate the transformation. In this project, we will work together to develop Cloud based cognitive IoT analytics platform services processing and analyzing the physical sensors data, especially unstructured data such as acoustic, video, text, leveraging advanced analytics technologies such as machine learning, graph theory, signal processing to discover business insights enabling industry solution innovations, especially unstructured data such as acoustic, video etc.

Desired skills

Ph.D. or Master students majored in Computer Science, Electronic Engineering, Industrial Automation or related areas with the following skills or experience:

  • Acoustic signal processing, voice recognition, feature engineering, machine learning or data mining
  • Graph theory and applications, graph signal processing, multi-modal data fusion (e.g. Image/voice collaborative perception) or machine learning
  • Hands-on experience on building software systems and algorithm optimization
  • Experience on deep learning framework such as Caffe, TensorFlow, Theano is preferred
  • Experience on real world use cases is preferred
  • Creative, independent and self-motivated
C‑2017‑05

Project 5: Cognitive cloud/IT operations analytics

[ Project description | Close ]

Cognitive Cloud/IT Operation analytics aims to mine actionable insights with Big Data and advanced analytics techniques from huge volume machine generated data (e.g. logs, events, configurations, metrics, system changes) and service management data (e.g. ticket, event, change requests, fixing scripts, FAQ, forum posts, emails) within large-scale highly distributed environments (e.g. Cloud, internet/enterprise applications) in order to facilitate proactive anomaly detection and faster troubleshooting towards an intelligent cloud operations. More specifically, we expect an intern to:

  • Develop innovative model and algorithms for anomaly detection & root cause analysis by correlating the multi-type of operations data
  • Generate anomaly patterns and remediation actions with machine learning techniques
  • Analyze unstructured services management data to generate knowledge for automatic problem determination and self-healing

Desired skills

Ph.D. or Master Students majored in computer science or related areas with the following skills or experience:

  • Distributed systems and cloud computing
  • Big data analytics with Hadoop, Spark, etc
  • Data mining, machine learning and deep learning
  • Hands-on experience on building software systems with python, java, scala and learning packages e.g. sklearn, sparkml, tensorflow
  • Creative, independent and Self-motivated
  • Team collaboration
C-2017-06

Project 6: Cognitive data curation

[ Project description | Close ]

In the cognitive era, the curation of data has become more prominent, particularly for software processing high volume and complex data systems. However, most curation processes require human intervention and an awful lot of manual work, which leads to curation inefficiency as well as delay of value creation through data analytics. This project aims to extremely automate the curation processes through intelligent data discovery, transformation, matching, lineage tracking, etc. across the entire data life-cycle.

Desired skills

Ph.D. or Master Students majored in computer science or related areas with the following skills or experience:

  • Data understanding and ETL
  • Big data analytics with Hadoop, Spark, etc
  • Data mining and machine learning / deep learning
  • Hands-on experience on bash scripts, Linux.
  • Creative, independent and Self-motivated
  • Team collaboration
C-2017-07

Project 7: Multimodal interaction

[ Project description | Close ]

At IBM Research – China, we are working on multimodal interaction technologies, including speech recognition, speech synthesize, natural language processing, question answering, and dialogue systems. Our technologies have been applied to many applications/solutions to solve real-world client problems.

We are looking for Ph.D or Master students in Computer Science, Electronics Engineering, Linguistics, Information Science, Applied Mathematics, or related areas, with the following qualifications to strengthen our team:

Desired skills

  • Experience with machine learning, deep learning, data mining, statistical modeling
  • Experience with at least one of the following topics: speech recognition, speech synthesize, natural language processing, question answering, or dialogue systems
  • Strong programming skills in at least one of the following languages: C, C++, Java, Python
  • Experience with deep learning toolkits, such as Tensorflow, Caffe, MXNet, Theano, is a plus
C-2017-08

Project 8: Artificial nose project

Location: Beijing / Shanghai, China

[ Project description | Close ]

The topic is a far-reaching research topic: to establish an artificial nose system to “smell” the world. This topic will comprise two parts: (1) a hardware part, which requires to build and test a module or device suitable for the detection of certain chemical make-ups; and (2) a software part, using machine learning to establish the “smell” fingerprint. The focused application industry of this research topic is Environmental Monitoring, but not limited to this.

Desired skills

This position requires passion, creativity, and industry insight to perform independent research and the ability of solving real-world problems, and the successful candidates should have strong academia background, leadership, and excellent communication skills. We are looking for candidates with PhD or Master in Material Science, Electrical Engineering, Statistics, Artificial Intelligence, Machine Learning, Computer Science or related areas with the following qualifications:

  • Experience in the areas of data analytic, optimization, simulation, or related areas, with strong publications.
  • Excellent programming and system development skills (Java, C/C++, R, Matlab).
  • Self-motivated, responsible, good team-work and communication skill.
  • Industry experience, especially in energy and environment, is a plus.
  • Experience in data analytics tool, big data analytics or cloud computing is a plus
C-2017-09

Project 9: Machine learning / Environmental analytics topics

Location: Beijing, China

[ Project description | Close ]

Data is the world’s new natural resource and basis of competitive advantage. The innovative industry solutions and services based on big data analytics & optimization will become the next frontier of industry transformation. The Industries and Solutions department of IBM Research – China is one of the fastest growing groups to apply advanced business analytics & optimization technology to solve real-world industry challenges. Our mission is to become a world-class research organization creating innovative solutions and services for industries.

The Industries and Solutions department aims to establish the industry thought leadership and innovative solutions through the synergy of advanced technologies and deep industry insights. It focuses on advanced analytics, mathematical optimization and data mining.

  1. The focused industries include but are not limited to:
    • Green Horizon Program (one of the highlight programs at IBM Research – China).
      [ JDCom | PVTech }
    • Big data analysis in environment, renewable energy, electrics to solve realistic problems for customer.
  2. Deep research in big data analysis, artificial intelligence, statistics, data mining, machine learning, spatial analysis and optimization algorithm.

Desired skills

This position requires passion, creativity, and industry insight to perform independent research and solve real-world client problems, and the successful candidates should have strong analytical, leadership, and excellent communication skills. We are looking for candidates with PhD or Master in Operation Research, Statistics, Artificial Intelligence, Machine Learning, Computer Science or related areas with the following qualifications:

  • Experience in the areas of data analytic, optimization, simulation, or related areas, with strong publications.
  • Excellent programming and system development skills (Java, C/C++, R, Matlab).
  • Self-motivated, responsible, good team-work and communication skill.
  • Industry experience, especially in energy and environment, is a plus.
  • Experience in data analytics tool, big data analytics or cloud computing is a plus
C-2017-10

Project 10: Cognitive healthcare

[ Project description | Close ]

To conduct research on developing big data analytics and cognitive computing methods for healthcare data, building data mining and machine learning models on real world clinical/behavior/genomic data, as well as evaluate and improve the model performance.

Desired skills

We are looking for Ph.D or Master students majoring in Medical Informatics, Information Science, Computer Science, Statistics, Applied Mathematics, or related areas, with the following desired skills:

  • Strong in machine learning, data mining and statistics.
  • Strong in software development, proficiency in at least one advanced programming language, such as Python and Java.
  • Familiar with data analysis tools and libraries, such as SPSS, Weka, R and/or scikit-learn, etc.
  • Knowledge of clinical/genomic/behavior data is a plus.
  • Familiar with big data platforms and tools, such as HDFS, Hadoop, Mahout and Spark is a plus.
C-2017-11

Project 11: Cognitive internet of things (IoT) industry solutions

[ Project description | Close ]

The Internet of Things (IoT) research of IBM Research – China is leading to develop world class technology and cognitive solution research innovations enabling the enterprises across industries to accelerate the transformation. In this project, we will work together to develop Cloud and Mobile end innovations to effectively processing and analyzing the physical sensors data, correlated with contextual information like weather, map, knowledge base etc. and system of record data like quality records, accident claim, blockchain transactions etc. to develop Cognitive IoT industry solution innovations focusing on connected manufacturing industry 4.0 operations, connected insurance, connected healthcare, connected vehicle and connected machinery innovations etc.

  • Design/develop innovate machine learning algorithm, data analysis model to discover insight from multiple source IoT sensor data (visual, acoustic, vibration, cardiograph, GPS, gyroscope etc.)
  • Design/develop contextual knowledge graph model and knowledge learning algorithm leveraging Cognitive analytics (Machine Learning, Deep Learning, Natural Language Processing, etc.)
  • Publish and author high quality research papers/patents, Design/integrate with mobile application/solution.

Desired skills

Ph.D. or Master Students majored in computer science, electronic engineering, industrial automation, civil engineering or related areas with the following skills or experience:

  • Data mining and machine learning skills.
  • Hands-on experience on building software systems (e.g.: Mobile, Web etc).
  • Creative, independent and Self-motivated, Team collaboration.
  • Experience in industry oriented research and development is a plus