Siddharth Deshpande
Verified Expert in Engineering
Data Scientist and Developer
Siddharth是一名跨学科研究人员,他的独特观点来自于翻译项目和他在材料工程方面的综合教育背景, biochemistry, healthcare, natural language processing (NLP), and data science. 他在处理生物结构化和非结构化数据以及使用最先进的人工智能技术解决复杂的医疗保健问题方面拥有丰富的经验.
Portfolio
Experience
Availability
Preferred Environment
GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Biomedical Skills, Machine Learning, Language Models, Unstructured Data Analysis, Data Visualization, Artificial Intelligence (AI), Biochemistry, Amazon Web Services (AWS), Python
The most amazing...
...我开发的是一个NLP框架,它从文档中提取生物医学实体并将其可视化为网络图,以发现新的生物医学关系.
Work Experience
Chief Technological Officer (Interim)
Immersely
- Worked for Immersely, 是什么让游戏开发者能够创造出能够实时适应玩家情感的超个性化游戏, boosting engagement to create better, more commercially successful games.
- 负责开发机器学习模型,使用生理信号来检测一个人在玩游戏时的情绪,以开发互动游戏体验.
- 负责为公司开发技术路线图和后端技术基础设施.
Deep Tech Venture Builder
Post Urban Ventures
- Validated technological feasibility of new startup ideas before funding, built technical prototypes (MVP) for pre-seed and seed round investor pitches, and supported early-stage startups with essential technical infrastructure.
- 曾担任四家初创公司的临时首席技术官,并在Post Urban Ventures中担任两家初创公司的技术顾问.
- Contributed to securing a £5 million grant in funding for startups successfully.
- Involved in preparing technical pitch decks, offered expert advice and guidance, and helped promote startup success. Designed technical roadmaps for scaling startups after pre-seed and seed rounds.
Senior AI/ML and NLP Chatbot Developer
Richmond Ayirebide
- 根据客户需求,利用ChatGPT开发了一个会计聊天机器人, finetuned GPT-3, and Telegram.
- 简化了预处理和后处理,将结果格式化为易于查看的Excel表格.
- 帮助为聊天机器人在云基础设施中的未来部署制定计划.
Chief Technological Officer (Interim)
Bioleap
- Brought on board to develop the technical framework for Bioleap, a startup focused on developing AI-based single-cell models.
- Managed the building of cloud capabilities in AWS, hired a competent technical team, and improved the current mechanistic models.
- 与领先的生物建模实验室建立了多个战略技术合作伙伴关系. Built a cloud-based automation strategy for Bioleap models.
- Established a technology strategy (tech stack), technical roadmap, and business plan to support the growth strategy.
NLP Data Scientist
Evaluate Ltd
- 开发了一个新闻稿分类器,将新闻文章分为40个技术类, saving the company around 30,000 pounds per year in third-party API licenses.
- Identified digital health innovations from clinical trials, news articles, 并为一个定制分析项目处理文档,该项目减少了日本客户手工文档分类的工作时间.
- Created a core NLP framework to extract biomedical entities from unstructured texts and visualize them as a graphical network; the framework became popular for discovering new biomedical relations and was subsequently used in many Evaluate products.
Data Scientist
Patsnap
- Developed PatSnap Bio, 一个核心产品,是最大的序列搜索平台之一,被大型制药公司积极使用.
- Created PatSnap Materials, another core product under Beta testing in China.
- 积极参与PatSnap Bio和PatSnap Materials的产品开发和客户反馈过程.
- Filed five patent applications involving my technology.
Experience
COVID-19 Scientific Journals Analysis
http://github.com/siddharth0112358/coronavirus_19Research papers available on GitHub:
•AutoDetect_COVID_FakeNews—用于检测有关COVID的假新闻的分类模型
•BERT_semantic_search -语义搜索,在COVID语料库中查找类似的句子以响应查询问题
•biorelated_sentence_extracaction_covid -从COVID语料库中提取生物相关的句子
•covid_19_topic_modelelling_top2vec -使用Top2Vec对COVID_19语料库进行主题建模
• COVID_explore_drugs - Explore drugs in the COVID corpus
•Covid - 19_ques_and_ans -基于doc2vec的Covid论文问答系统
•covid - 19_ner_text_summarization_and_topic_modeling - BART摘要和LDA主题建模和NER
• Covid_19_genome_analysis - COVID_19 genome analysis
• Covid_paper_rank_display - NER and covid papers recovery based on topic
• Medical_NER_Corona - NER on coronavirus dataset
• Mining_COVID_keywords - mining keywords using bigrams and trigrams
Alibaba Cloud Global AI Innovation Challenge
我的项目目标是分析天气对能源生产和需求的影响,并找到一个可以使用天气参数预测可再生能源生产和能源需求的解决方案.
SOLUTION HIGHLIGHTS
•利用气候和时间参数预测太阳能、风能和水能发电.
•能源需求预测使用时间和能源参数(模型1)和时间完成, energy, and climate parameters (Model 2). Model 2 showed slightly higher accuracy than Model 1. 结果表明,气候参数对能源需求的影响不像能源参数那样显著.
•能源价格预测使用时间和能源参数(模型1)和时间, energy, and climate parameters (Model 2). Model 2 showed higher accuracy than Model 1. It shows that climate parameters affect energy prices significantly.
For all the above cases, 10 million regression algorithms were tested. ExtraTreeRegressor算法表现最好,并用于建立回归模型.
URL: http://www.alibabacloud.com/blog/project-showcase-%7C-effect-of-weather-on-energy-generation-and-demand_598252
Conversational Chatbots
•对话助手-这个机器人帮助模拟艰难的对话,以便客户可以事先练习对话. The client is scored on 2-3 conversation skills, 最后会生成一份报告,显示他的分数以及如何提高他的会话能力.
•时尚助手-该机器人根据客户需求和企业库存推荐时尚单品. It uses a combination of GPT-3 and DALL-E.
•谷歌机器人-这个机器人有一个谷歌搜索引擎的能力,并作为一个顾问/朋友,你可以问任何问题, 它会在后台运行谷歌搜索,为你提供最新的答案.
Bot previews can be shown during interviews.
Education
Doctorate in Medicine
National University of Singapore - Singapore
Master's Degree in Materials Science and Engineering
National University of Singapore - Singapore
Bachelor's Degree in Metallurgy and Material Science
College of Engineering Pune - Pune, India
Certifications
Healthcare NLP for Data Scientists
John Snow Labs
Spark NLP for Data Scientists
John Snow Labs
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI | via Coursera
Deep Learning for Healthcare Specialization
University of Illinois at Urbana-Champaign | via Coursera
Customizing Your Models with TensorFlow 2
Imperial College London | via Coursera
Generative Adversarial Networks (GANs) Specialization
DeepLearning.AI | via Coursera
Deployment of Machine Learning Models
Udemy
Natural Language Processing in Python
DataCamp
Natural Language Processing Specialization
DeepLearning.AI | via Coursera
AI in Healthcare Specialization
Stanford University | via Coursera
Deep Learning Specialization
DeepLearning.AI | via Coursera
Skills
Libraries/APIs
TensorFlow, PySpark, Spark ML
Tools
Microsoft Excel, SOLIDWORKS
Industry Expertise
Bioinformatics, Healthcare
Languages
Python, Python 3
Storage
JSON
Platforms
Amazon Web Services (AWS)
Paradigms
Data Science
Other
Natural Language Processing (NLP), Machine Learning, Data Visualization, Biochemistry, Analytics, Biology, Pharmacology, R&D, Engineering, CSV File Processing, Excel Expert, Interactive Charts, Spark NLP, Chatbots, Patents, GPT, Generative Pre-trained Transformers (GPT), Biomedical Skills, Language Models, Unstructured Data Analysis, Artificial Intelligence (AI), Biomaterial, Composite Materials, Deep Learning, Dash, Deep Neural Networks, Convolutional Neural Networks (CNN), Sequence Models, Entrepreneurship, Web Scraping, Time Series Analysis, Computational Biology, Game AI, Emotion Recognition, Chatbot Conversation Design, LangChain, Weviate, Pinecone, Cell Biology, Materials Science, 3D Printing, Product Development, Model Deployment, Generative Adversarial Networks (GANs), Single-cell Modeling, CTO, Pitch Preparation, Medical Diagnostics, OpenAI, Generative Pre-trained Transformer 3 (GPT-3), Google Custom Search
How to Work with Toptal
在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring