Experienced in the field of data engineering, data analytics, and software development, with a strong track record of delivering successful projects and driving growth. Have a reputation for being results-driven, well-organized, and detail-oriented. Areas of expertise also includes advanced machine learning and data visualization.
Have experience in product management and design, as well as training and mentoring product teams. Extensive experience with Amazon Web Services (AWS) and proficient in various technologies, including Cloudera Hadoop, Apache Spark, Python, Java, and more.
As the Chief Technology Officer and Founder of Onescrybe.ai, I lead the company’s technical strategy by integrating strong mathematical foundations with practical, production-level AI engineering. With dual master’s degrees in Mathematics and Computer Science, I combine deep theoretical expertise with hands-on experience in Python, machine learning, and scalable system design. This background enables me to architect robust AI solutions, guide technical innovation, and translate complex computational concepts into clear, strategic business value. My work focuses on building high-performance AI platforms that balance rigorous scientific principles with real-world applicability.
In this role, I was responsible for extracting and analyzing metadata from Kibana components—including dashboards,
visualizations, Lens assets, and saved searches—as well as Elasticsearch indices, policies, and transforms.
I mapped fields across these assets to establish dependency relationships, enabling precise impact assessments
for planned schema or field modifications. I also oversaw daily document-count monitoring to detect anomalies
in data ingestion and ensure proactive alerts for operational continuity.
Additionally, I assessed and implemented large language models (LLMs) to optimize daily workflows within
classified projects. My work focused on three key areas: automating code refactoring, standardizing
documentation practices, and iteratively improving the architectural design of new task pipelines.
These efforts streamlined processes, enhanced consistency, and accelerated the development of
mission-critical solutions.
Developed data tools, algorithms, UI Frameworks to monitor and improve business performance using Srpring/Java Framework, Pythhon/Fastapi. Served as a technical lead on large, complex architecture that involved 30-50 Terrabytes of data in ElasticSearchh. Working across multiple teams in supporting various algorithms to boost performance and handle high volumes of data efficiently.
Devised and executed sustainable, data-driven solutions using cloud and data technologies, creating and deploying end-to-end systems. Conducted code reviews, verified quality control processes, and ensured optimal performance. Orchestrated large-scale Cross Device and Match Testing and Integration (CDMTI) functions, generating an growth of $5M per year. Automated ETL process for select customers, resulting in accelerated CDMTI functions, reduced errors, and improved operational efficiency. Technologies include Cloudera Hadoop, Apache Spark, Hive, Python, D3.js, JavaScript, Jenkins, JIRA Confluence, GIT Repository, Java, Scala, R, Scikit-Learn.
Navigated software engineering product development, overseeing continuous improvement activities and establishing standards and best practices. Designed control system architecture, created user interfaces, and administered key tools. Led a diverse team in synthesizing clinical and FDA data for a MapReduce system, identifying productivity and improvement areas for hospitals. Collaborated with technical specialists to standardize data across its lifecycle through development and governance. Technologies used: Cloudera Hadoop, Hive, Python, D3.js, JavaScript, JIRA Confluence, GIT Repository, Java, Scikit-Learn. Applied Regression, Random Forest, Text Mining, and Natural Language Processing techniques.
Played a critical role in managing a team of engineers in the development of multiple prducts For Provisioning System, targeting the ISP market. Designed and administered a proof of concept for venture capital purposes, successfully securing a first round of $10M funding. Contributed to the recruitment of high-performing engineers. This startup was sold to a third party software company in 2003.
Involved in the early implementation of a Search Engine that was based on a research paper at UCBerkeley.
Interesting technologies in the 'Cloud' that stand-out are many, but to name a few that I am interested are:
import speech_recognition as sr # Create a recognizer object recognizer = sr.Recognizer() # Capture audio from the microphone with sr.Microphone() as source: print("Say something...") audio = recognizer.listen(source) # Perform speech recognition try: text = recognizer.recognize_google(audio) print("You said:", text) except sr.UnknownValueError: print("Sorry, I couldn't understand.") excepti sr.RequestError as e: print("Error fetching results; {0}".format(e))
In this example, the code captures audio from the microphone, processes it using Google's speech recognition service, and then prints the recognized text. However, various other engines and models can be used with the SpeechRecognition library.
While speech recognition has made impressive strides, challenges remain, such as handling accents, noisy environments, and complex sentence structures. Ongoing research focuses on improving accuracy and expanding language support.
As technology evolves, speech recognition is expected to play an integral role in enabling more intuitive human-computer interaction, making devices and applications more accessible and user-friendly for everyone.
importi numpy as np from sklearn.linear_model import LinearRegression import matplotlib.pyplot as plt # Sample data X = np.array([1, 2, 3, 4, 5]).reshape(-1, 1) y = np.array([2, 4, 5, 4, 5]) # Create a linear regression model model = LinearRegression() # Train the model model.fit(X, y) # Make predictions predictions = model.predict(X) # Plot the data and the regression line plt.scatter(X, y, label='Data') plt.plot(X, predictions, color='red', label='Regression Line') plt.xlabel('Input') plt.ylabel('Output') plt.legend() plt.show()

| Name | Album | Singer | Time | Link |
|---|

| Name | Singer(s) | Type | Music Director | Link |
|---|